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Overview 



The goal of this set of lectures is to combine two seemingly unrelated topics: 

• The study of Boolean functions, a field particularly active in computer science 

• Some models in statistical physics, mostly percolation 

The link between these two fields can be loosely explained as follows: a percolation 
configuration is built out of a collection of i.i.d. "bits" which determines whether the 
corresponding edges, sites, or blocks are present or absent. In that respect, any event 
concerning percolation can be seen as a Boolean function whose input is precisely these 
"bits". 

Over the last 20 years, mainly thanks to the computer science community, a very 
rich structure has emerged concerning the properties of Boolean functions. The first 
part of this course will be devoted to a description of some of the main achievements 
in this field. 

In some sense one can say, although this is an exaggeration, that computer scientists 
are mostly interested in the stability or robustness of Boolean functions. As we will see 
later in this course, the Boolean functions which "encode" large scale properties of 
critical percolation will turn out to be very sensitive to small perturbations. This 
phenomenon corresponds to what we will call noise sensitivity. Hence, the Boolean 
functions one wishes to describe here are in some sense orthogonal to the Boolean 
functions one encounters, ideally, in computer science. Remarkably, it turns out that 
the tools developed by the computer science community to capture the properties and 
stability of Boolean functions are also suitable for the study of noise sensitive functions. 
This is why it is worth us first spending some time on the general properties of Boolean 
functions. 

One of the main tools needed to understand properties of Boolean functions is 
Fourier analysis on the hypercube. Noise sensitivity will correspond to our Boolean 
function being of "high frequency" while stability will correspond to our Boolean func- 
tion being of "low frequency". We will apply these ideas to some other models from 
statistical mechanics as well; namely, first passage percolation and dynamical percola- 
tion. 

Some of the different topics here can be found (in a more condensed form) in |Garllj . 
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Some standard notations 

In the following table, f{n) and g{n) are any sequences of positive real numbers. 



fin) X g{n) 


there exists some constant C > such that 

/H <c, Vn>l 
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fin) < Oigin)) 


there exists some constant C > such that 
fin) < Cgin) , Vn > 1 


fin) > nigin)) 


there exists some constant C > such that 

fin) > Cgin) , Vn > 1 


fin) = oigin)) 
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Chapter I 

Boolean functions and key concepts 



1 Boolean functions 

Definition I.l. A Boolean function is a function from the hypercube Q„ := {—1, 1}" 
into either {—1, 1} or {0, 1}. 

Qn will be endowed with the uniform measure P = = + ^^i)®" and E will 

denote the corresponding expectation. At various times, f2„ will be endowed with the 
general product measure Pp = Pp = {{1 — p)6-i + p^i)®" but in such cases the p will 
be explicit. Kp will then denote the corresponding expectations. 

An element of f2„ will be denoted by either cu or cun and its n bits by a;i, . . . , so 
that uj — {xi, . . . , Xn)- 

Depending on the context, concerning the range, it might be more pleasant to work 
with one of{— l,l}or{0,l} rather than the other and at some specific places in these 
lectures, we will even relax the Boolean constraint (i.e. taking only two possible values). 
In these cases (which will be clearly mentioned), we will consider instead real- valued 
functions / : f2„ — >■ R. 

A Boolean function / is canonically identified with a subset Af of fin via Af := {u : 

m = 1}- 

Remark I.l. Often, Boolean functions are defined on {0, 1}" rather than r2„ = { — 1, 1}". 
This does not make any fundamental difference at all but, as we will see later, the choice 
of { — 1, 1}" turns out to be more convenient when one wishes to apply Fourier analysis 
on the hypercube. 

2 Some Examples 

We begin with a few examples of Boolean functions. Others will appear throughout 
this chapter. 
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CHAPTER I. BOOLEAN FUNCTIONS AND KEY CONCEPTS 



Example 1 (Dictatorship). 



= Xi 



The first bit determines what the outcome is. 



Example 2 (Parity). 



n 



PA.R^(xi, . . . , Xn) '■ 



Hxi 



1=1 



This Boolean function tells whether the number of — I's is even or odd. 

These two examples are in some sense trivial, but they are good to keep in mind 
since in many cases they turn out to be the "extreme cases" for properties concerning 
Boolean functions. 

The next rather simple Boolean function is of interest in social choice theory. 
Example 3 (Majority function). Let n be odd and define 



Following are two further examples which will also arise in our discussions. 

Example 4 (Iterated 3-Majority function). Let n = 3'^ for some integer k. The bits are 
indexed by the leaves of a rooted 3-ary tree (so the root has degree 3, the leaves have 
degree 1 and all others have degree 4) with depth k. One iteratively applies the previous 
example (with n — 3) to obtain values at the vertices at level k — 1, then level k — 2, etc. 
until the root is assigned a vahic. The root's value is then the output of /. For example 
when k = 2, /(— 1, 1, 1; 1, — 1, — 1; — 1, 1, — 1) = —1. The recursive structure of this 
Boolean function will enable explicit computations for various properties of interest. 

Example 5 (Clique containment). If r = (g) for some integer n, then Q,. can be 
identified with the set of labelled graphs on n vertices, {xi is 1 iff the ith edge is 
present.) Recall that a clique of size A; of a graph G = {V,E) is a complete graph on 
k vertices embedded in G. 

Now for any 1 < k < = r, let CLIQ^ be the indicator function of the event 
that the random graph G^^ defined by a; G $7^ contains a clique of size k. Choosing 
k = kn so that this Boolean function is non-degenerate turns out to be a rather delicate 
issue. The interesting regime is near kn ~ 21og2(n). See the exercises for this "tuning" 
oi k = kn- It turns out that for most values of n, the Boolean function CLIQ^^ is 
degenerate (i.e. has small variance) for all values of k. However, there is a sequence of 
n for which there is some k — kn for which CLIQ^ is nondegerate. 



n 




i=l 



3. PIVOTALITY AND INFLUENCE 11 

3 Pivotality and Influence 

This section contains our first fundamental concepts. We will abbreviate {1, . . . ,n} by 
[n]. 

Definition 1.2. Given a Boolean function f from Qn into either { — 1, 1} or {0, 1} and 
a variable i G [n], we say that i is pivotal for / for u if {f{uj) ^ /(w*)} where is 
u but flipped in the ith coordinate. Note that this event is measurable with respect to 

Definition 1.3. The pivotal set, V, for f is the random set of [n] given by 

V{uj) = Vfiu) := {i G [n] : i is pivotal for f for u}. 

In words, it is the (random) set of bits with the property that if you flip the bit, 
then the function output changes. 

Definition 1.4. The infiuence of the ith bit, !«(/), is defined by 

:= P( i IS pivotal for f )= P(i G V). 
Let also the infiuence vector, Inf(/), be the collection of all the influences: i.e. 

{Ii(/)}i6[n]- 

In words, the influence of the ith bit, !«(/), is the probability that, on flipping this 
bit, the function output changes. 

Definition 1.5. The total infiuence, I(/), is defined by 

!(/):= =||Inf(/)||i(=E(|P|)). 

i 

It would now be instructive to go and compute these quantities for examples [l]-[3| 
See the exercises. 

Later, we will need the last two concepts in the context when our probability measure 
is Pp instead. We give the corresponding definitions. 

Definition 1.6. The infiuence vector at level p, {If(/)}ie[n], is defined by 

lP{f) ■= Fp{ i IS pivotal for f )= Pp(i G V). 
Definition 1.7. The total infiuence at level p, is defined by 
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It turns out that the total influence has a geometric-combinatorial interpretation as 
the size of the so-called edge-boundary of the corresponding subset of the hypercube. 
See the exercises. 

Remark 1.2. Aside from its natural definition as well as its geometric interpretation as 
measuring the edge-boundary of the corresponding subset of the hypercube (see the 
exercises), the notion of total influence arises very naturally when one studies sharp 
thresholds for monotone functions 



as we will see in detail in Chapter 



III 



to be defined in Chapter III). Roughly speaking, 
for a monotone event A, one has that dPp [A~\ / dp 
is the total influence at level p (this is the Margulis-Russo formula). This tells us that 
the speed at which one changes from the event A "almost surely" not occurring to the 
case where it "almost surely" does occur is very sudden if the Boolean function happens 
to have a large total influence. 



4 The Kahn, Kalai, Linial theorem 

This section addresses the following question. Does there always exist some variable 
i with (reasonably) large influence? In other words, for large n, what is the smallest 
value (as we vary over Boolean functions) that the largest influence (as we vary over 
the different variables) can take on? 

Since for the constant function all influences are 0, and the function which is 1 only if 
all the bits are 1 has all influences l/2"~^, clearly one wants to deal with functions which 
are reasonably balanced (meaning having variances not so close to 0) or alternatively, 
obtain lower bounds on the maximal influence in terms of the variance of the Boolean 
function. 

The first result in this direction is the following result. A sketch of the proof is given 
in the exercises. 

Theorem I.l (Discrete Poincare). /// is a Boolean function mapping Qn into { — 1, 1}, 
then 

Varif) <J2Uf). 

i 

R follows that there exists some i such that 

Uf) > Var(/)/n. 

This gives a first answer to our question. For reasonably balanced functions, there is 
some variable whose influence is at least of order 1/n. Can we find a better "universal" 
lower bound on the maximal influence? Note that for Example [3] all the influences are 
of order l/\/n (and the variance is 1). In terms of our question, this universal lower 
bound one is looking for should lie somewhere between 1/n and 1 / i/ra. The following 
celebrated result improves by a logarithmic factor on the above Q{l/n) bound. 



4. THE KAHN, KALAI, LINIAL THEOREM 
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Theorem 1.2 ( |KKL88] ). There exists a universal c > such that if f is a Boolean 
function mapping f2„ into {0, 1}, then there exists some i such that 

I^(/)>cVar(/)(logn)/n. 

What is remarkable about this theorem is that this "logarithmic" lower bound on 
the maximal influence turns out to be sharpl This is shown by the following example 
by Ben-Or and Linial. 

Example 6 (Tribes). Partition [n] into subsequent blocks of length log2(?T-)— log2(log2(?T-)) 
with perhaps some leftover debris. Deflne / = to be 1 if there exists at least one 
block which contains all I's, and otherwise. 

It turns out that one can check that the sequence of variances stays bounded away 
from and that all the influences (including of course those belonging to the debris 
which are equal to 0) are smaller than c(logn)/n for some c < oo. See the exercises for 
this. Hence the above theorem is indeed sharp. 

Our next result tells us that if all the influences are "small" , then the total influence 
is large. 

Theorem 1.3 ( |KKL88] ). There exists a c > such that if f is a Boolean function 
mapping Qn into {0, 1} and 6 := maxjlj(/) then 

I(/)>cVar(/)log(l/5). 

Or equivalently, 

||I„f(/)|K>cVM/)log^,_^. 

One can in fact talk about the influence of a set of variables rather than the influence 
of a single variable. 

Definition 1.8. Given S C [n], the influence of S, ls{f)> is defined by 

^s{f) := P( / is not determined by the bits in 5*^). 

It is easy to see that when S* is a single bit, this corresponds to our previous defl- 
nition. The following is also proved in [KKL88j . We will not indicate the proof of this 
result in these lecture notes. 

Theorem 1.4 ( |KKL88] ). Given a sequence fn of Boolean functions mapping Qn into 
{0, 1} such that < inf„E„(/) < sup„E„(/) < 1 and any sequence an going to oo arbi- 
trarily slowly, then there exists a sequence of sets Sn C [n] such that \Sn\ < a^ri/logn 
and IsSfn) —^1 as n —i- oo. 



Theorems |I.2| and 1.3 will be proved in Chapter [V| 
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5 Noise sensitivity and noise stability 

This section introduces our second set of fundamental concepts. 

Let u be uniformly chosen from Qn and let be u but with each bit independently 
"rerandomized" with probability e. This means that each bit, independently of every- 
thing else, rechooses whether it is 1 or —1, each with probability 1/2. Note that 
then has the same distribution as u. 

The following definition is central for these lecture notes. Let m„ be an increasing 
sequence of integers and let /„ : Qm„ — !'{±l}or{0,l}. 

Definition 1.9. The sequence {/«} is noise sensitive if for every e > 0, 

lim E[fr,{u)fn{u,)] - E[/„M]2 = 0. (LI) 

n— ^-oo 

Since fn just takes 2 values, this says that the random variables /„(a;) and fn{<^e) 



are asymptotically independent for e > fixed and n large. We will see later that (I.l) 
holds for one value of e G (0, 1) if and only if it holds for all such e. The following 
notion captures the opposite situation where the two events above are close to being 
the same event if e is small, uniformly in n. 

Definition 1. 10. The sequence {fn} is noise stable if 

limsupP(/„(w) = 0. 

It is an easy exercise to check that a sequence {/„} is both noise sensitive and noise 
stable if and only it is degenerate in the sense that the sequence of variances {Var(/„)} 
goes to 0. Note also that a sequence of Boolean functions could be neither noise sensitive 
nor noise stable (see the exercises). 

It is also an easy exercise to check that Example [l] (dictator) is noise stable and 
Example [2] (parity) is noise sensitive. We will see later, when Fourier analysis is brought 
into the picture, that these examples are the two opposite extreme cases. For the other 

6l 



examples, it turns out that Example |3] (Majority) is noise stable, while Examples |4 
are all noise sensitive. See the exercises. In fact, there is a deep theorem (see |MOO10] ) 



which says in some sense that, among all low influence Boolean functions. Example [3] 
(Majority) is the stablest. 



In Figure I.l, we give a slightly impressionistic view of what "noise sensitivity" is. 



6 The Benjamini, Kalai and Schramm noise sensi- 
tivity theorem 



The following is the main theorem concerning noise sensitivity. 



6. THE BENJAMIN!, KALAI AND SCHRAMM NOISE SENSITIVITY THE0REM15 







Figure I.l: Let us consider the following "experiment": take a bounded domain in the 
plane, say a rectangle, and consider a measurable subset A of this domain. What would 
be an analogue of the above definitions of being noise sensitive or noise stable in this 
case? Start by sampling a point x uniformly in the domain according to Lebesgue 
measure. Then let us apply some noise to this position x so that we end up with a new 
position x^. One can think of many natural "noising" procedures here. For example, 
let be a uniform point in the ball of radius e around x, conditioned to remain in the 
domain. (This is not quite perfect yet since this procedure does not exactly preserve 
Lebesgue measure, but let's not worry about this.) The natural analogue of the above 
definitions is to ask whether 1^(2;) and Ia^x^) are decorrelated or not. 
Question: According to this analogy, discuss the stability versus sensitivity of the sets 
A sketched in pictures (a) to (d) ? Note that in order to match with definitions 1.9 and 



1. 10, one should consider sequences of subsets {A^} instead, since noise sensitivity is 



an asymptotic notion. 
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Theorem 1.5 ( |BKS99] ). // 



then {fn} is noise sensitive. 



limVlfc(/O' = 0, 

rj, ' * 



Remark 1.3. The converse is clearly false as shown by Example [2j However, it turns 
out that the converse is true for so-called monotone functions (see the next chapter 



for the definition of this) as we will see in Chapter IV 



This theorem will allow us to conclude noise sensitivity of many of the examples 
we have introduced in this first chapter. See the exercises. This theorem will also be 
proved in Chapter |V| 



7 Percolation crossings: our final and most impor- 
tant example 

We have saved our most important example to the end. This set of notes would not be 
being written if it were not for this example and for the results that have been proved 
for it. 

Let us consider percolation on at the critical point Pc(^^) = 1/2. (See Chapter 
[njfor a fast review on the model.) At this critical point, there is no infinite cluster, but 
somehow clusters are 'large' (there are clusters at all scales). This can be seen using 
duality or with the RSW Theorem II. 1 In order to understand the geometry of the 



critical picture, the following large-scale observables turn out to be very useful: Let f2 
be a piecewise smooth domain with two disjoint open arcs di and 82 on its boundary 
dQ. For each > 1, we consider the scaled domain nQ. Let An be the event that there 
is an open path in u from ndi to nd2 which stays inside nfl. Such events are called 
crossing events. They are naturally associated with Boolean functions whose entries 
are indexed by the set of edges inside nfl (there are 0{n^) such variables). 
For simplicity, let us consider the particular case of rectangle crossings: 



Example 7 (Percolation crossings). 



h • n 



Let a, 6 > and let us consider the rect- 
angle [0, a ■ n] X [0,6 ■ n]. The left to 
right crossing event corresponds to the 
Boolean function /„ : {-1, l}'^(i)"^ -)■ 
{0, 1} defined as follows: 



/n(w) 



a ■ n 



if there is a left- 
right crossing 
otherwise 
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We will later prove that this sequence of Boolean functions {fn} is noise sensitive. 
This means that if a percolation configuration cu ~ Pp^=i/2 is given to us, one cannot 
predict anything about the large scale clusters of the slightly perturbed percolation 
configuration ou^ (where only an e-fraction of the edges have been resampled). 

Remark 1.4. The same statement holds for the above more general crossing events (i.e. 
in (nJl, ndi, nd2))- 
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Exercise sheet of Chapter 11 



Exercise I.l. Determine the pivotal set, the influence vector and the total influence 
for Examples ll]-[3j 



Exercise 1.2. Determine the influence vector for iterated 3-majority and tribes. 

Exercise 1.3. Show that in Example 6 (tribes) the variances stay bounded away from 
0. If the blocks are taken to be of size logg n instead, show that the influences would 
all be of order 1/n. Why does this not contradict the KKL Theorem? 

Exercise 1.4. fi„ has a graph structure where two elements are neighbors if they differ 
in exactly one location. The edge boundary of a subset A C f2„, denoted by dE{A), 
is the set of edges where exactly one of the endpoints is in A. 
Show that for any Boolean function, I(/) = \dE{Af)\/2'^'~^ . 



Exercise 1.5. Prove Theorem I.l This is a type of Poincare inequality. Hint: use the 
fact that Var(/) can be written 2P[/(ci;) 7^ /(^)], where a;, a; are independent and try 
to "interpolate" from uoiou. 



Exercise 1.6. Show that Example |3] (Majority) is noise stable. 

Exercise 1.7. Prove that Example |4] (iterated 3-majority) is noise sensitive directly 



without relying on Theorem 1.5 Hint: use the recursive structure of this example in 



order to show that the criterion of noise sensitivity is satisfied. 

Exercise 1.8. Prove that Example [6] (tribes) is noise sensitive directly without using 



Theorem L5 Here there is no recursive structure, so a more "probabilistic" argument 
is needed. 

Problem 1.9. Recall Example [5] (clique containment). 

(a) Prove that when fc„ = o(n^/^), CLIQ^" is asymptotically noise sensitive. Hint: 
start by obtaining an upper bound on the influences (which are identical for each 



edge) using Exercise 1.4 Conclude by using Theorem 1.5 



(b) Open exercise: Find a more direct proof of this fact (in the spirit of exercise 1.8) 



which would avoid using Theorem 1.5 
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As pointed out after Example for most values of k = kn, the Boolean function 
CLIQ^" becomes degenerate. The purpose of the rest of this problem is to determine 
what the interesting regime is where CLIQ,^" has a chance of being non-degenerate (i.e. 
variance bounded away from 0). The rest of this exercise is somewhat tangential to the 
course. 

(c) If 1 < A; < (2) = what is the expected number of chques in G^, w G fir ? 

(d) Explain why there should be at most one choice of = fc„ such that the variance of 
CLIQ^" remains bounded away from ? (No rigorous proof required.) Describe 
this choice of kn- Check that it is indeed in the regime 21og2(^). 

(e) Note retrospectively that in fact, for any choice of k = kn, CLIQ^" is noise 
sensitive. 



Exercise 1. 10. Deduce from Theorem 1.5 that both Example (iterated 3-majority) 



and Example |6] (tribes) are noise sensitive. 

Exercise I.ll. Give a sequence of Boolean functions which is neither noise sensitive 
nor noise stable. 



Exercise 1.12. In the sense of Definition 1.8, show that for the majority function and 
for fixed e, any set of size n^/^"*"*^ has influence approaching 1 while any set of size n^^"^'^ 
has influence approaching 0. 

Exercise 1.13. Show that there exists c > such that for any Boolean function 

l2(/)>cVar2(/)(log2n)/n 
and show that this is sharp up to a constant. This result is also contained in |KKL88] . 



Problem 1.14. Do you think a "generic" Boolean function would be stable or sensitive? 
Justify your intuition. Show that if /„ was a "randomly" chosen function, then a.s. {/„} 
is noise sensitive. 



Chapter II 

Percolation in a nutshell 



In order to make these lecture notes as self-contained as possible, we review various 
aspects of the percolation model and give a short summary of the main useful results. 

For a complete account of percolation, see |Gri99j and for a study of the 2-dimensional 
case, which we are concentrating on here, see the lecture notes |Wer07] . 



1 The model 

Let us briefly start by introducing the model itself. 

We will be concerned mainly with two-dimensional percolation and we will focus 
on two lattices: J? and the triangular lattice T. (All the results stated for 1? in these 
lecture notes are also valid for percolations on "reasonable" 2-d translation invariant 
graphs for which the RSW Theorem (see the next section) is known to hold at the 
corresponding critical point.) 

Let us describe the model on the graph 1? which has T? as its vertex set and 
edges between vertices having Euclidean distance 1. Let denote the set of edges 
of the graph 1? . For any p G [0, 1] we define a random subgraph of 1? as follows: 
independently for each edge e G E^, we keep this edge with probability p and remove it 
with probability 1— p. Equivalently, this corresponds to defining a random configuration 
uj G { — 1, l}'^ where, independently for each edge e G E^, we declare the edge to be 
open {ijj[e) = 1) with probability p or closed (w(e) = —1) with probability 1 — p. The 
law of the so-defined random subgraph (or configuration) is denoted by Pp. 

Percolation is defined similarly on the triangular grid T, except that on this lattice 
we will instead consider site percolation (i.e. here we keep each site with probability 
p). The sites are the points Z + e^'^/^Z so that neighboring sites have distance one from 
each other in the complex plane. 
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CHAPTER II. PERCOLATION IN A NUTSHELL 




L □ 



ffl 



n □ I 



Figure II. 1: Pictures (by Oded Schramm) representing two percolation configurations 
respectively on T and on 7? (both at p = 1/2). The sites of the triangular grid are 
represented by hexagons. 



2 Russo- Seymour- Welsh 

We will often rely on the following celebrated result known as the RSW Theorem. 

Theorem II. 1 (RSW). (see lGri99f ) For percolation on 7? at p = 1/2, one has the 
following property concerning the crossing events. Let a,b> 0. There exists a constant 
c = c(a, b) > 0, such that for any n > 1, if A^ denotes the event that there is a left to 
right crossing in the rectangle ([0, a ■ n] x [0, h ■ n]) fl 7? , then 

C<Pi/2[v4„] < 1-C. 

In other words, this says that the Boolean functions fn defined in Example^of Chapter 
are non-degenerate. 

The same result holds also in the case of site-percolation on T (also at p = 1/2). 

The parameter p = 1/2 plays a very special role 
for the two models under consideration. Indeed, 
there is a natural way to associate to each per- 
colation configuration ~ Pp a dual configu- 
ration ujp* on the so-called dual graph. In the 
case of Z^, its dual graph can be realized as 
+ (1, 1). In the case of the triangular lat- 
tice, T* = T. The figure on the right illustrates 
this duality for percolation on I?. It is easy to 
see that in both cases p* = 1 — p. Hence, at 
p = 1/2, our two models happen to be self-dual. 

This duality has the following very important consequence. For a domain in T with 
two specified boundary arcs, there is a 'left-right' crossing of white hexagons if and only 
if there is no 'top-bottom' crossing of black hexagons. 
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3 Phase transition 



In percolation theory, one is interested in large scale connectivity properties of the 



random configuration to 



In particular, as one raises the level p above a certain 



critical parameter pc{1j^), an infinite cluster (almost surely) emerges. This corresponds 
to the well-known phase transition of percolation. By a famous theorem of Kesten this 
transition takes place at PcC^"^) = |- On the triangular grid, one also has Pc(T) = 1/2. 
The event {0 oo} denotes the event that there exists a self-avoiding path from 
to oo consisting of open edges. 



This phase transition can be measured with 
the density function 6i2{p) := Pp(0 < — )■ oo) 
which encodes important properties of the 
large scale connectivities of the random con- 
figuration uj: it corresponds to the density av- 
eraged over the space I? of the (almost surely 
unique) infinite cluster. The shape of the 
function Ojp is pictured on the right (notice 
the infinite derivative at Pc)- 




4 Conformal invariance at criticality and SLE pro- 
cesses 

It has been conjectured for a long time that percolation should be asymptotically con- 
formally invariant at the critical point. This should be understood in the same way 
as the fact that a Brownian motion (ignoring its time-parametrization) is a confor- 
mally invariant probabilistic object. One way to picture this conformal invariance is 
as follows: consider the 'largest' cluster surrounding in 51? fl D and such that 
Cs n = 0. Now consider some other simply connected domain Vt containing 0. Let 
Cs be the largest cluster surrounding in a critical configuration in 57? fl Vt and such 
that Cs n dVt = 0. Now let be the conformal map from D to f2 such that 0(0) = and 
0'(O) > 0. Even though the random sets 4>{Cs) and Cs do not lie on the same lattice, the 
conformal invariance principle claims that when 5 = o(l), these two random clusters 
are very close in law. 

Over the last decade, two major breakthroughs have enabled a much better under- 
standing of the critical regime of percolation: 

• The invention of the SLE processes by Oded Schramm( |SchOO] ) . 

• The proof of conformal invariance on T by Stanislav Smirnov ( |Smi01] ). 
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The simplest precise statement concerning conformal invariance is the following. 
Let be a bounded simply connected domain of the plane and let A, B, C and D 
be 4 points on the boundary of Q in clockwise order. Scale the hexagonal lattice T 
by 1/n and perform critical percolation on this scaled lattice. Let F{Q,A,B,C,D,n) 
denote the probability that in the 1/n scaled hexagonal lattice there is an open path 
of hexagons in Q going from the boundary of Q between A and B to the boundary of 
Q between C and D. 

Theorem II. 2. (Smirnov, jSmiOlf ) 

(i) For all Vl and A,B,C and D as above, 

P(fi, A, B, C, D, oo) := lim P(fi, A, B, C, D, n) 

n— >oo 

exists and is conformally invariant in the sense that if f is a conformal mapping, then 
P(n, A, B, C, D, oo) = P(/(fi), /(A), /(S), /(C), /(D), oo). 

(a) If Q is an equilateral triangle (with side lengths 1), A,B and C the three corner 
points and D on the line between C and A having distance x from C , then the above 
limiting probability is x. (Observe, by conformal invariance, that this gives the limiting 
probability for all domains and 4 points.) 

The first half was conjectured by M. Aizenman while J. Cardy conjectured the 
limit for the case of rectangles using the four corners. In this case, the formula is quite 
complicated involving hypergeometric functions but Lennart Carleson realized that this 
is then equivalent to the simpler formula given above in the case of triangles. 

Note that, on at Pc = 1/2, proving the conformal invariance is still a challenging 
open problem. 

We will not define the SLE processes in these notes. See the lecture notes by Vincent 
Beffara and references therein. The illustration below explains how SLE curves arise 
naturally in the percolation picture. 

This celebrated picture (by 
Oded Schramm) represents an 
exploration path on the tri- 
angular lattice. This ex- 
ploration path, which turns 
right when encountering black 
hexagons and left when en- 
countering white ones, asymp- 
totically converges towards 
SLEg (as the mesh size goes to 
0). 
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5 Critical exponents 



The proof of conformal invariance combined with the detailed information given by 
the SLEg process enables one to obtain very precise information on the critical and 
near- critical behavior of site percolation on T. For instance, it is known that on the 
triangular lattice the density function 6j{p) has the following behavior near Pc = 1/2: 

e{p) = (p- 1/2)5/36+0(1)^ 

when p -> 1/2+ (see |Wern7] ). 

In the rest of these lectures, we will often rely on three types of percolation events: 
namely the one-arm, two-arm and four-arm events. They are defined as follows: for 
any radius i? > 1, let A]^ be the event that the site is connected to distance R by 



let Aj^ be the event that there are two 



arms 



of 



some open path (one- arm). Next, 
different colors from the site (which itself can be of either color) to distance R away. 
Finally, let Aj^ be the event that there are four "arms" of alternating color from the site 
(which itself can be of either color) to distance R away (i.e. there are four connected 
paths, two open, two closed from to radius R and the closed paths lie between the 
open paths). See Figure II. 2 for a realization of two of these events. 





Figure II. 2: A realization of the one-arm event is pictured on the left; the four-arm 
event is pictured on the right. 



It was proved in |LSW02] that the probability of the one- arm event decays as follows: 

P[^y := ai(i?) = . 
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For the two-arms and four-arms events, it was proved by Smirnov and Werner in |SW01j 
that these probabihties decay as follows: 

F[Al] := = 

and 

Remark II. 1. Note the o(l) terms in the above statements (which means of course 
goes to zero as i? — ?■ oo). Its presence reveals that the above critical exponents are 
known so far only up to 'logarithmic' corrections. It is conjectured that there are no 
such 'logarithmic' corrections, but at the moment one has to deal with their possible 
existence. More specifically, it is believed that for the one- arm event, 

ai{R) ^R-^ 

where x means that the ratio of the two sides is bounded away from and oo uniformly 
in R; similarly for the other arm events. 

The four exponents we encountered concerning 9j, ai, a2 and (i.e. ^, ^, | and 
|) are known as critical exponents. 

The four-arm event is clearly of particular relevance to us in these lectures. Indeed, 
if a point x is in the 'bulk' of a domain {nQ,ndi,nd2), the event that this point is 
pivotal for the Left-Right crossing event An is intimately related to the four- arm event. 
See Chapter |VT] for more details. 



6 Quasi-multiplicativity 

Finally, let us end this overview by a type of scale invariance property of these arm 
events. More precisely, it is often convenient to "divide" these arm events into different 
scales. For this purpose, we introduce a^^r, R) (with r < i?) to be the probability that 
the four-arm event holds from radius r to radius R {ai{r, R), 02 (r, R) and a^^r, R) are 
defined analogously). By independence on disjoint sets, it is clear that if ri < r2 < 
then one has a^^rijr^) < a4(ri,r2) a4(r2,r3). A very useful property known as quasi- 
multiplicativity claims that up to constants, these two expressions are the same (this 
makes the division into several scales practical). This property can be stated as follows. 

Proposition II. 3 (quasi-multiplicativity, jKes87j ). For any ri < r2 < r^, one has 

(both for 1? and T percolations ) 

aiiri,r3) x a^^ri, ai{r2, rg) . 

See |Wer07t INolOQt ISSlOb] for more details. Note also that the same property holds 
for the one-arm event. However, this is much easier to prove: it is an easy consequence 



of the RSW Theorem II. 1 and the so-called FKG inequality which says that increasing 



events are positively correlated. The reader might consider doing this as an exercise. 



Chapter III 



Sharp thresholds and the critical 
point for 2-d percolation 

1 Monotone functions and the Margulis-Russo for- 
mula 

The class of so-called monotone functions plays a very central role in this subject. 

Definition III.l. A function f is monotone if x < y (meaning Xi < Ui for each i) 
implies that f{x) < f{y). An event is monotone if its indicator function is monotone. 

Recall that when the underlying variables are independent with 1 having probability 
p, we let Pp and Ep denote probabilities and expectations. 

It is fairly obvious that for / monotone, lEp(/) should be increasing in p. The 
Margulis-Russo formula gives us an explicit formula for this (nonnegative) derivative. 

Theorem III.l. Let A he an increasing event in Q„. Then 

i 

Proof. Let us allow each variable Xi to have its own parameter p^ and let Ppi,...,j,„ and 
IEpi,...,p„ be the corresponding probability measure and expectation. It suffices to show 
that 

5(P(P....,P„)(^))/% = I?"-''"H^) 
where the definition of this latter term is clear. WLOG, take i = \. Now 

Pp„...,p„(A) = Pp,,...,p„(^\{i G Va}) + Pp„...,p„(^ n {1 e Va}). 

The event in the first term is measurable with respect to the other variables and hence 
the first term does not depend on px while the second term is 

PiPp„...,p„({lePA}) 

since A n {1 e Va) is the event {xi = 1} n {1 e VaS- □ 
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2 KKL away from the uniform measure case 



Recall now Theorem 1.2 For sharp threshold results, one needs lower bounds on the 
total influence not just at the special parameter 1/2 but at all p. 

The following are the two main results concerning the KKL result for general p that 
we will want to have at our disposal. The proofs of these theorems will be outlined in 
the exercises in Chapter [V] 

Theorem III. 2 ( |BKK"'"92j ). There exists a universal c > such that for any Boolean 
function f mapping Qn into {0, 1} and, for any p, there exists some i such that 

>cVarp(/)(logn)/n 

Theorem III. 3 ( |BKK"'"92j ). There exists a universal c > such that for any Boolean 
function f mapping Qn into {0, 1} and for any p, 

F(/)>cVar,(/)log(l/5p) 

where Sp := maxjlf(/). 



3 Sharp thresholds in general : the Friedgut-Kalai 
Theorem 

Theorem III. 4 ( |FK96j ). There exists a Ci < oo such that for any monotone event A 
on n variables where all the influences are the same, if Pp^ {A) > e, then 

P ciiog(i/(2.)) (A) > 1 - e. 



pi 



log n 



Remark III.l. This says that for fixed e, the probability of A moves from below e to 
above 1 — e in an interval of p of length of order at most 1/ log(n). The assumption of 
equal influences holds for example if the event is invariant under some transitive action, 
which is often the case. For example, it holds for Example |4] (iterated 3-majority) as 
well as for any graph property in the context of the random graphs G{n,p). 



Proof. Theorem |I1I.2| and all the influences being the same tell us that 

F(A) > cmin{Pp(A), 1 - Pp(A)} logn 



for some c > 0. Hence Theorem III.l yields 



d{\og{Fp{A)))/dp > clogn 

if Pp(y4) < 1/2. Letting := + an easy computation (using the fundamental 

theorem of calculus) yields 

log(Pp*(A))>log(l/2). 



4. THE CRITICAL POINT FOR PERCOLATION FOR 1? AND T IS I 



29 



Next, if Pp(A) > 1/2, then 

d{log{l ~Fp{A)))/ dp < -clogn 
from which another apphcation of the fundamental theorem yields 

log(l-Pp,.(A))<-log(l/e) 
where p** := p* + ^^f^^-^- Letting ci = 2/c gives the result. □ 



4 The critical point for percolation for and T is 

1 

2 

Theorem III. 5 r |Kes8n] ). 

Pc(Z^)=Pe(T) = ^. 

Proof. We first show that 0(1/2) = 0. Let Ann(£) := [-3i,3i]\[-i, i] and Ck be 
the event that there is a circuit in Ann (4'^) + 1/2 in the dual lattice around the origin 
consisting of closed edges. The C^'s are independent and RSW and FKG show that for 
some c > 0, Pi/2(C*fc) > c for all k. This gives that Pi/2(C'a; infinitely often) = 1 and 
hence 9(1/2) = 0. 

The next key step is a finite size criterion which implies percolation and which is 
interesting in itself. We outline its proof afterwards. 

Proposition III. 6. (Finite size criterion) Let Jn he the event that there is a crossing 
of a 2n X (n — 2) box. For any p, if there exists an n such that 

Pp(Jn) > .98, 

then a.s. there exists an infinite cluster. 

Assume now that pc = 1/2 + 6 with 5 > 0. Let / = [1/2,1/2 + 5/2]. Since 
0(1/2 + 6/2) = 0, it is easy to see that the maximum influence over all variables 
and over all p & I goes to with n since being pivotal implies the existence of an 
open path from a neighbor of the given edge to distance n/2 away. Next, by RSW, 



inf„Pi/2(^n) > 0. If for all n, Pi/2+5/2(-'n) < -98, then Theorems III.l and III. 3 would 
allow us to conclude that the derivative of fp(Jn) goes to oo uniformly on / as n — )■ cxd, 
giving a contradiction. Hence Pi/2+(5/2(<^n) > -98 for some n implying, by Proposition 



III.6 that 6^(1/2 + 6/2) > 0, a contradiction. □ 
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Outline of proof of Proposition III. 6 , 

The first step is to show that for any p and for any e < .02, if Pp( J„) > 1 — e, then 
Pp(J2n) > 1 — e/2. The idea is that by FKG and "glueing" one can show that one can 
cross a 4n X (n — 2) box with probability at least 1 — 5e and hence one obtains that 
Pp(J2n) > 1 — e/2 since, for this event to fail, it must fail in both the top and bottom 
halves of the box. It follows that if we place down a sequence of (possibly rotated and 
translated) boxes of sizes 2""*"^ x 2" anywhere, then with probability 1, all but finitely 
many are crossed. Finally, one can place these boxes down in an intelligent way such 
that crossing all but finitely many of them necessarily entails the existence of an infinite 
cluster (see Figure III.l). □ 




Figure III.l: 



5 Further discussion 

The Margulis-Russo formula is due independently to Margulis |Mar74j and Russo 
|Rus81] . 

The idea to use the results from KKL to show that pc = 1/2 is due to BoUobas 
and Riordan (see |BR06] ). It was understood much earlier that obtaining a sharp 
threshold was the key step. Kesten (see |Kes80j ) showed the necessary sharp threshold 
by obtaining a lower bound on the expected number of pivotals in a hands on fashion. 
Russo (see |Rus82j ) had developed an earlier weaker, more qualitative, version of KKL 
and showed how it also sufficed to show that pc = 1/2. 



Exercise sheet of Chapter |III 



Exercise III.l. Develop an alternative proof of the Margulis-Russo formula using 
classical couplings. 

Exercise III. 2. Study, as best as you can, what the "threshold windows" are (i.e. 
where and how long does it take to go from a probability of order e to a probability of 
order 1 — e) in the following examples: 

(a) for DICT„ 

(b) forMAJ„ 

(c) for the tribes example 

(d) for the iterated majority example. 

Do not rely on |KKL88j type of results, but instead do hands-on computations 
specific to each case. 



Exercise III. 3. Write out the details of the proof of Proposition III. 6 



Problem III. 4 {What is the "sharpest" monotone event ?). Show that among all 
monotone Boolean functions on MAJ„ is the one with largest total influence (at 
p=l/2). 

Hint: Use the Margulis-Russo formula. 



Exercise III. 5. A consequence of Problem III. 4 is that the total influence at p = 1/2 



of any monotone function is at most 0{^/n). A similar argument shows that for any p, 
there is a constant Cp so that the total influence at level p of any monotone function 
is at most Cp^fn. Prove nonetheless that there exists c > such for for any n, there 
exists a monotone function f = fn and a p = p„ so that the total influence of / at level 
p is at least cn. 

Exercise III. 6. Find a monotone function f : Qn ^ {0)1} such that d{E,p{f))/dp is 
very large at p = 1/2, but nevertheless there is no sharp threshold for / (this means 
that a large total influence at some value of p is not in general a sufficient condition for 
sharp threshold). 
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Chapter IV 



Fourier analysis of Boolean 
functions (first facts) 



1 Discrete Fourier analysis and the energy spec- 
trum 

It turns out that in order to understand and analyze the concepts previously introduced, 
which are in some sense purely probabilistic, a critical tool is Fourier analysis on the 
hypercube. 

Recall that we consider our Boolean functions as functions from the hypercube 
fin '■= { — 1) 1}"^ into { — 1, 1} or {0, 1} where Q„ is endowed with the uniform measure 

P = P" = (l5_i + l5i)®'^. 

In order to apply Fourier analysis, the natural setup is to enlarge our discrete space 
of Boolean functions and to consider instead the larger space L^({ — 1, 1}") of real- valued 
functions on Q„ endowed with the inner product: 

if^g) — XI 2""/(^i,---,2:n)^(a;i,---,a;n) 

= E[fg] for all e L^Q^) , 

where E denotes expectation with respect to the uniform measure P on 

For any subset S C {1, 2 . . . , n}, let X5 be the function on {—1,1}" defined for any 
x = {xi,...,Xn) by 

Xs(x):^l[xi. (IV.l) 

(So X9 = !•) It is straightforward (check this!) to see that this family of 2" functions 
forms an orthonormal basis of L^({ — 1, 1}"). Thus, any function / on (and a fortiori 
any Boolean function /) can be decomposed as 

SC{l,...,n} 
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where {f{S)}sc[n] are the so-called Fourier coefficients of /. They are also sometimes 
called the Fourier- Walsh coefficients of / and they satisfy 

f{S) := {f,Xs)=^fXs]. 

Note that /(0) is the average IE[/] and that we have Parseval's formula which states 
that 

E(f)= E /'(^)- 

5C{l,...,n} 

As in classical Fourier analysis, if / is some Boolean function, its Fourier (-Walsh) 
coefficients provide information on the "regularity" of /. We will sometimes use the 
term spectrum when referring to the set of Fourier coefficients. 

Of course one may find many other orthonormal bases for — 1, 1}""), but there are 
many situations for which this particular set of functions {xs}sc{i,...,n} arises naturally. 
First of all there is a well-known theory of Fourier analysis on groups, a theory which 
is particularly simple and elegant on Abelian groups (thus including our special case 
of {—1,1}", but also M/Z, M and so on). For Abelian groups, what turns out to be 
relevant for doing harmonic analysis is the set G of characters of G (i.e. the group 
homomorphisms from G to C*). In our case of G = {— 1, 1}", the characters are precisely 
our functions xs indexed by S' C {1, . . . ,n} since they satisfy Xs{x ■ y) = Xs{x)xs{y)- 
This background is not however needed and we won't talk in these terms. 

These functions also arise naturally if one performs simple random walk on the 
hypercube (equipped with the Hamming graph structure), since they are the eigen- 
functions of the corresponding Markov chain (heat kernel) on { — 1, 1}". Last but not 
least, we will see later in this chapter that the basis {xs} turns out to be particularly 
well adapted to our study of noise sensitivity. 

We introduce one more concept here without motivation; it will be very well moti- 
vated later on in the chapter. 

Definition IV. 1. For any real-valued function / : ri„ — )■ M, the energy spectrum Ef 

is defined by 

Ef{m):= fiS)', Vme{l,...,n}. 

\S\=m 

2 Examples 

First note that, from the Fourier point of view. Dictator and Parity have simple repre- 
sentations since they are xi and X[n] respectively. Each of the two corresponding energy 
spectra are trivially concentrated on 1 point, namely 1 and n. 

For Example [3} the Majority function, Bernasconi explicitly computed the Fourier 
coefficients and when n goes to infinity, one ends up with the following asymptotic 
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formula for the energy spectrum: 



|5|=m 



7rm2'" 



{Zj) + 0{m/n) if m is odd . 







if m is even . 



(The reader may think about why the "even" coefficients are 0.) See |O'D03j for a 
nice overview and references therein concerning the spectral behavior of the majority 
function. 



m 



1 3 5 



n 



Figure IV. 1: Shape of the energy spectrum for the Majority function 



Picture IV. 1 represents the shape of the energy spectrum of MAJ„: its spectrum 
is concentrated on low frequencies, which is typical of stable functions. 



3 Noise sensitivity and stability in terms of the en- 
ergy spectrum 

In this section, we describe the concepts of noise sensitivity and noise stability in terms 
of the energy spectrum. 

The first step is to note that, given any real- valued function / : fi„ — > M, the 
correlation between f{u) and fioj^) is nicely expressed in terms of the Fourier coefficients 
of / as follows: 

E[/H/K)] = E[(J]/(50x5.M)(E/(^2)X5.K))] 

Si 52 

= ^/(S)2e[x5Mx5(u;.)] 

5 
5 
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Moreover, we immediately obtain 

n 

Cov(/(a;), /(a;,)) = ^ Ef{m){l - e)™. (IV.3) 

m=l 

Note that either of the last two expressions tell us that Cov(/(ci;), /(we)) is nonneg- 
ative and decreasing in e. Also, we see that the "level of noise sensitivity" of a Boolean 
function is naturally encoded in its energy spectrum. It is now an an easy exercise to 
prove the following proposition. 

Proposition IV. 1 ( |BKS 99]). A sequence of Boolean functions fn '■ {—1,1}™" — )■ {0,1} 
is noise sensitive if and only if, for any k > 1, 

k k 



m=l |5|=m m=l 



Moreover, (I.l) holding does not depend on the value of e G (0, 1) chosen. 



There is a similar spectral description of noise stability which, given (IV.2), is an 
easy exercise. 

Proposition IV.2 ([BKS99]). A sequence of Boolean functions fn ■ {—1, 1}""" — {0, 1} 
is noise stable if and only if, for any e > 0, there exists k such that for all n, 

oo oo 

E E fn{Sr = J2Euim)<e. 

m=k \S\=m m=k 

So, as argued in the introduction, a function of "high frequency" will be sensitive 
to noise while a function of "low frequency" will be stable. 

4 Link between the spectrum and influence 

In this section, we relate the notion of influence with that of the spectrum. 
Proposition IV.3. // / : — i- {0, 1}, then for all k, 

uf) = 4 E /(^)' 



S:keS 

and 



i(/) = 4Ei^i/(^)' 
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Proof. If / : r2„ — 7- M, we introduce the functions 

Vfcf : < X , XX for all k e \n] , 

where ak acts on by flipping the k^^ bit (thus V^/ corresponds to a discrete derivative 
along the k^^ bit). 
Observe that 

V,/M= 5^ /(5)[X5M-X5KH)]= 5^ 2/(5) X5M, 

5C{l,...,n} SC{1,... ,n},fce5 

from which it follows that for any 5* C [n], 

Vr/(S) = { f (IV.4) 

Clearly, if / maps into {0,1}, then Ifc(/) := ||Vfc/||i and since V^/ takes values 
in { — 1,0,1} in this case, we have ||Va;/||i = ||Vfc/|||. Applying Parseval's formula to 



Vfc/ and using (IV.4), one obtains the first statement of the proposition. The second 



is obtained by summing over k and exchanging the order of summation. □ 

Remark IV. 1. If / maps into { — 1, 1} instead, then one can easily check that Ia;(/) = 
Es:kesfiSr ^ndlif) = Zs\S\fiS?- 

5 Monotone functions and their spectrum 

It turns out that for monotone functions, there is an alternative useful spectral descrip- 
tion of the infiuences. 

Proposition IV.4. // / : — )■ {0, 1} is monotone, then for all k 

Uf) = 2f{{k}) 

If f maps into { — 1, 1} instead, then one has that\k{f) = f{{k})- (Observe that Parity 
shows that the assumption of monotonicity is needed here; note also that the proof shows 
that the weaker result with = replaced by > holds in general.) 

Proof. We prove only the first statement; the second is proved in the same way. 

f{{k}) ■.= E[fx{k}] =E[fx{k}I{km] +^fX{k}I{keV}] 

It is easily seen that the first term is (independent of whether / is monotone or not) 

I ( f) 

and the second term is due to monotonicity. □ 
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Remark IV. 2. This tells us that, for monotone functions mapping into {—1,1}, the sum 



in Theorem |I.5| is exactly the total weight of the level 1 Fourier coefficients, that is, the 
energy spectrum at 1, Ef{l). (If we map into {0, 1} instead, there is simply an extra 
irrelevant factor of 4.) So Theorem 1.5 and Propositions IV. 1 and IV.4 imply that for 
monotone functions, if the energy spectrum at 1 goes to 0, then this is true for any 
fixed level. In addition. Propositions IV. 1 (with k = 1) and IV.4 easily imply that for 
monotone functions the converse of Theorem IL5I holds. 



Another application of Proposition |IV.4| gives a general upper bound for the total 
influence for monotone functions. 

Proposition IV. 5. // / : fi„ — ?■ {—1, 1} or {0, 1} is monotone, then 

I(/) < v^. 



Proof. If the image is {—1, 1}, then by Proposition IV.4, we have 



iif) = J2^k{f) = J2h{k}). 



k=l 



k=l 



By the Cauchy-Schwarz inequality, this is at most (X]fc=i /^({^}))^''^V^- By Parseval's 
formula, the ffist term is at most 1 and we are done. If the image is {0, 1}, the above 
proof can easily be modified or one can deduce it from the ffist case since the total 
influence of the corresponding ±1- valued function is the same. □ 

Remark IV. 3. The above result with some universal c on the right hand side follows 
(for odd n) from an earlier exercise showing that Majority has the largest influence 
together with the known influences for Majority. However, the above argument yields 
a more direct proof of the ^/n bound. 



Exercise sheet of chapter IIV 



Exercise IV. 1. Prove the discrete Poincare inequality, Theorem using the spec- 
trum. 

Exercise IV. 2. Compute the Fourier coefficients for the indicator function that there 
are all I's. 

Exercise IV. 3. Show that all even size Fourier coefficients for the Majority function 
are 0. Can you extend this result to a broader class of Boolean functions? 

Exercise IV. 4. For the Majority function MAJ„, find the limit (as the number of 
voters n goes to infinity) of the following quantity (total weight of the level-3 Fourier 
coefficients) 

^MAJ„(3):= Y,MAJn{Sf. 

|5|=3 

Exercise IV. 5. Let {/«} be a sequence of Boolean functions which is noise sensitive 
and {g-n} be a sequence of Boolean functions which is noise stable. Show that {/n} and 
{gn} are asymptotically uncorrelated. 

Exercise IV. 6 (Another equivalent definition of noise sensitivity). Assume that {An} 
is a noise sensitive sequence. (This of course means that the indicator functions of these 
events is a noise sensitive sequence.) 

(a) Show for each e > 0, we have that PftUe G A^, | — P[A„] approaches in 
probability. 

Hint: use the Fourier representation. 

(b) Can you show the above implication without using the Fourier representation? 

(c) Discuss if this implication is surprising. 

(d) Show that the condition in part (a) implies that the sequence is noise sensitive 
directly without the Fourier representation. 

Exercise IV. 7. How does the spectrum of a generic Boolean function look? Use this 
to give an alternative answer to the question asked in problem 1.14 of Chapter |T] 
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Exercise IV. 8. (Open exercise). For Boolean functions, can one have ANY (reason- 
able) shape of the energy spectrum or are there restrictions? 

For the next exercises, we introduce the following functional which measures the 
stability of Boolean functions. For any Boolean function / : f2„ — )■ { — 1, 1}, let 

: 6H^P[/M^/(u;,)]. 
Obviously, the smaller is, the more stable / is. 

Exercise IV. 9. Express the functional S/ in terms of the Fourier expansion of /. 

By a balanced Boolean function, we mean one which takes its two possible values 
each with probability 1/2. 

Exercise IV. 10. Among balanced Boolean functions, does there exist some function 
/* which is "stablest" in the sense that for any balanced Boolean function / and any 

e > 0, 

If yes, describe the set of these extremal functions and prove that these are the only 
ones. 

Problem IV. 11. In this problem, we wish to understand the asymptotic shape of the 
energy spectrum for MAJ„. 

(a) Show that for all e > 0, 

1 arcsinfl — e) arccosfl — e) 

hm §MAJ„(e) 
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Hint: The relevant limit is easily expressed as the probability that a certain 2- 
dimensional Gaussian variable (with a particular correlation structure) falls in a 
certain area of the plane. One can write down the corresponding density function 
and this probability as an explicit integral but this integral does not seem so easy 
to evaluate. However, this Gaussian probability can be computed directly by 
representing the joint distribution in terms of two independent Gaussians. 

Note that the above limit immediately implies that for /„ = MAJ„, 

1- Ttr/x / \ i- / \N 2arcsin('l - e) 
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(b) Deduce from (a) and the Taylor expansion for arcsin(a;) the limiting value, as 
n — )■ oo of EMAJ„{k) = J2\s\=k'^-^'^ n{SY for all k > 1. Check that the answer 
is consistent with the values obtained earlier for A; = 1 and k = 3 (Exercise IV.4). 



Chapter V 

Hypercontractivity and its 
applications 



In this lecture, we will prove the main theorems about influences stated in Chapter [1} 
As we will see, these proofs rely on techniques imported from harmonic analysis, in 



particular hypercontractivity. As we will see later in this chapter and in Chapter |VII 
these types of proofs extend to other contexts which will be of interest to us: noise 
sensitivity and sub-Gaussian fluctuations. 



1 Heuristics of proofs 

All the subsequent proofs which will be based on hypercontractivity will have more or 



less the same flavor. Let us now explain in the particular case of Theorem |I.2| what the 
overall scheme of the proof is. 

Recall that we want to prove that there exists a universal constant c > such that 
for any function / : i7„ — t- {0, 1}, one of its variables has influence at least g ^°g"^'^''(.^) _ 

Let / be a Boolean function. Suppose all its influences are "small" (this would 
need to be made quantitative). This means that V^/ must have small support. Using 
the intuition coming from the Weyl-Heisenberg uncertainty, V^/ should then be quite 
spread out in the sense that most of its spectral mass should be concentrated on high 
frequencies. 

This intuition, which is still vague at this point, says that having small influences 
pushes the spectrum of Vkf towards high frequencies. Now, summing up as we did in 
Section |4] of Chapter [TV] but restricting ourselves only to frequencies S of size smaller 
than some large (well-chosen) 1 ^ M <^ n, one easily obtains 
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0<|5|<A/ 0<|5|<Af 

k 0<|S|<A/ 
k 

= (v.i) 

where, in the third hne, we used the informal statement that V^/ should be supported 
on high frequencies if / has small influences. Now recall (or observe) that 

f{Sr = Var(/) . 

\s\>o 



E 



Therefore, in the above equation (V.I), if we are in the case where a positive fraction 



of the Fourier mass of / is concentrated below M, then (V.I) says that I(/) is much 
larger than Var(/). In particular, at least one of the influences has to be "large". If, on 
the other hand, we are in the case where most of the spectral mass of / is supported 
on frequencies of size higher than M, then we also obtain that I(/) is large by using 
the formula: 

I(/)=4$^|S|/(5)^ 

s 

Remark V.I. Note that these heuristics suggest that there is a subtle balance between 
^^Ifc(/) = I(/) and supfclfc(/). Namely, if influences are all small (i.e. || • ||oo is small), 
then their sum on the other hand has to be "large". The right balance is exactly 



quantified by Theorem 1.3 



Of course it now remains to convert the above sketch into a proof. The main diffi- 
culty in the above program is to obtain quantitative spectral information on functions 
with values in {—1, 0, 1} knowing that they have small support. This is done ( |KKL88] ) 
using techniques imported from harmonic analysis, namely hypercontractivity. 



2 About hypercontractivity 

First, let us state what hypercontractivity corresponds to. Let {Kt)t>o be the heat kernel 
on M". Hypercontractivity is a statement which quantifies how functions are regularized 
under the heat fiow. The statement, which goes back to a number of authors, can be 
simply stated as follows: 

Theorem V.I (Hypercontractivity). Consider K" with standard Gaussian measure. If 
1 < q < 2, there is some t = t{q) > (which does not depend on the dimension n) such 
that for any f G L«(M"), 

ll^**/l|2< 11/11.. 
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The dependence t = t{q) is explicit but will not concern us in the Gaussian case. 
Hypercontractivity is thus a regularization statement: if one starts with some initial 
"rough" L'^ function / outside of and waits long enough {t{q)) under the heat flow, 
then we end up being in with a good control on its norm. We will not prove nor 
use Theorem IV. 1[ 

This concept has an interesting history as is nicely explained in O'Donnell's lecture 
notes (see |Q'D] ). It was originally invented by Nelson in |Nel66j where he needed 
regularization estimates on Free fields (which are the building blocks of quantum field 
theory) in order to apply these in "constructive field theory" . It was then generalized by 
Gross in his elaboration of logarithmic Sobolev inequalities ( |Gro75j ). which is an im- 
portant tool in analysis. Hypercontractivity is intimately related to these Log-Sobolev 
inequalities and thus has many applications in the theory of Semigroups, mixing of 
Markov chains and other topics. 

We now state the result in the case which concerns us, namely the hypercube. For 
any p G [0,1], let Tp be the following noise operator on the set of functions on 
the hypercube: recall from Chapter |T] that if G we denote by cOe an e-noised 
configuration of oj. For any / : f2„ — )■ M, we define Tpf : u i— )■ K[f{ui-p) | u~\. This 
noise operator acts in a very simple way on the Fourier coefficients, as the reader can 
chc ck ' 



We have the following analogue of Theorem V.l 
Theorem V.2 (Bonami-Gross-Beckner). For any / : f2„ — )■ M and any p G [0, 1], 

||T,/||2<||/||l+,2. 



The analogy with the classical result V.l is clear: the heat flow is replaced here 



by the random walk on the hypercube. You can find the proof of Theorem |V.2 in the 
appendix attached to the present chapter. 

Remark V.2. The term hypercontractive refers here to the fact that one has an operator 
which maps L'^ into (g < 2), which is a contraction. 



Before going into the detailed proof of Theorem 1.2, let us see why Theorem V.2 



provides us with the type of spectral information we need. In the above sketch, we 
assumed that all infiuences were small. This can be written as 

Ifc(/) = ||V,/||i = ||Vfe/||^«l, 

for any k E [n]. Now if one applies the hypercontractive estimate to these functions 
Vfc/ for some fixed < p < 1, we obtain that 

\\Tp{V,f)h < ||V,/||i+p2 = ||V,/||f « ||V,/||2 (V.2) 
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where, for the equahty, we used once again that V^/ G { — 1, 0, 1}. After squaring, this 
gives on the Fourier side. 



S 



This shows (under the assumption that Ife(/) is small) that the spectrum of V^/ is 
indeed mostly concentrated on high frequencies. 



Remark V.3. We point out that Theorem V.2 in fact tells us that any function with 
small support has its frequencies concentrated on large sets as follows. It is easy to see 
that given any p < 2, if a function /i on a probability space has very small support. 



then its Lp norm is much smaller than its L2 norm. Using Theorem V.2 we would then 
have for such a function that 



IT. 



< ||/i||i+p2 ^ \\h\ 



yielding that 

s s 

which can only occur if h has its frequencies concentrated on large sets. From this point 
of view, one also sees that under the small influence assumption, one did not actually 



need the third term in (V.2) in the above outline 



3 Proof of the KKL Theorems on the influences of 
Boolean functions 



We will start by proving Theorem 1.2, and then Theorem 1.3 In fact, it turns out that 



one can recover Theorem 1.2 directly from Theorem 1.3; see the exercises. Nevertheless, 
since the proof of Theorem |I.2|is slightly simpler, we start with this one. 



3.1 Proof of Theorem O 



Let / : ^ {0, !}• Recall that we want to show that there is some k G [n] such that 

logn 



Ia:(/) >cVar(/)- 



n 



(V.3) 



for some universal constant c > 0. 

We divide the analysis into the following two cases. 
Case 1: 

Suppose that there is some k G [n] such that Ikif) > n"^/*^ Var(/). Then the bound 



V.3| is clearly satisfied for a small enough c > 0. 
Case 2: 
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Now, if / does not belong to the first case, this means that for all k G [n] 



Ifc(/) = ||Vfc/||^<Var(/)n 



-3/4 



(V.4) 



Following the above heuristics, we will show that under this assumption, most of 
the Fourier spectrum of / is supported on high frequencies. Let M > 1, whose value 
will be chosen later. We wish to bound from above the bottom part (up to M) of the 
Fourier spectrum of /. 



1<\S\<M 1<|S'|<M 

< 2^^^ 5^ (1/2)21^1 
|s|>i 



i2^^-^5^||ri/2(v./)|| 



2 

2 5 



see SectionHof Chapter IV). Now by applying hypercontractivity (Theorem V.2) with 



p = 1/2 to the above sum, we obtain 



1<|5|<A/ 



k 

< 22^^5^I,(/)«/^ 

k 



where we used the assumption V.4 and the obvious fact that Var(/)^/^ < Var(/) (recall 



Var(/) < 1 since / is Boolean). Now with M := log2 n\ , this gives 



J2 /('^)' ^ Var(/) = ^-1/^° Var(/) 



l<|S|<^log2n 



This shows that under our above assumption, most of the Fourier spectrum is con- 
centrated above ^(logn). We are now ready to conclude: 
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k n n 

> 



n 

\S\>M 
\S\>M 

= ^[Var(/)- E nSf] 

1<\S\<M 

> ^Var(/)[l-n-i/i°] 



with ci = 2oiog2 ('^ ~ 2^^/^°). By combining with the constant given in case 1, this 
completes the proof. □ 

Remark V.4. We did not try here to optimize the proof in order to find the best possible 
universal constant c > 0. Note though, that even without optimizing at all, the constant 
we obtain is not that bad. 



3.2 Proof of Theorem |L3 



We now proceed to the proof of the stronger result. Theorem |I.3[ which states that 
there is a universal constant c > such that for any / : f2„ — )■ {0, 1}, 

||I(/)l| = ||Inf(/)||i>cVar(/)log 



|Inf(/)||. 



The strategy is very similar. Let / : — )■ {0,1} and let 5 := ||Inf(/)||oo = 
sup^Ifc(/). Assume for the moment that 5 < 1/1000. As in the above proof, we start 
by bounding the bottom part of the spectrum up to some integer M (whose value will 
be fixed later). Exactly in the same way as above, one has 



1<\S\<M k 
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Now, 



Var(/) = J2 fiS? < E /(^)' + ]^ E 



\s\>i 



l<|5|<Af 



\S\>M 



< r-^^^/^ + ^] I(/) . 



Choose M := j|log2(|) — ^ logg log2(|). Since 6 < 1/1000, it is easy to check that 
M > log2(l/5) which leads us to 



Var(/) < 



+ 



10 



l0g2(l/5) l0g2(l/5)J 



(V.5) 



which gives 



I(/) = ||Inf(/)|K> 



1 



Var(/) log 



1 



lllog2 ----"^ ||Inf(/)|U' 

This gives us the result for 6 < 1/1000. 

Next the discrete Poincare inequality, which says that I(/) > Var(/), tells us that 
the claim is true for 6 > 1/1000 if we take c to be 1/log 1000. Since this is larger than 

□ 



-I , / „ , we obtain the result with the constant c 

1 1 log 2 ' 



1 



11 log 2' 



4 KKL away from the uniform measure 



In Chapter III (on sharp thresholds), we needed an extension of the above KKL Theo- 
rems to the p-biased measures Pp = {p6i + (1 — These extensions are respec- 



tively Theorems III. 2 and III. 3 



A first natural idea in order to extend the above proofs would be to extend the 
hypercontractive estimate (Theorem V.2 ) to these p-biased measures Pp. This extension 



of Bonami-Gross-Beckner is possible, but it turns out that the control it gives gets worse 
near the edges {p close to or 1). This is problematic since both in Theorems III. 2 and 



III. 3, we need bounds which are uniform in p G [0, 1]. 

Hence, one needs a different approach to extend the KKL Theorems. A nice ap- 
proach was provided in |BKK+92J . where they prove the following general theorem. 



Theorem V.3 ( [BKK+92j ). There exists a universal c > such that for any measurable 
function f : [0, 1]" — )■ {0, 1}, there exists a variable k such that 



Uf) > cVar(/) 



logn 



n 



Here the 'continuous' hypercube is endowed with the uniform (Lebesgue) measure and 
for any k E [n], Ifc(/) denotes the probability that f is not almost-surely constant on 
the fiber given by {xi)i^k- 
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In other words, 



!,(/) = P[Var(/( 



^ A;) > 0] . 



It is clear how to obtain Theorem III. 2 from the above theorem. If p G [0, 1] and 
/ : fi„ ^ {0, 1}, consider fp : [0, 1]" ^ {0, 1} defined by 



/((I 



Friedgut noticed in |Fri04j that one can recover Theorem V.3 from Theorem III. 2 



The first idea is to use a symmetrization argument in such a way that the problem 
reduces to the case of monotone functions. Then, the main idea is the approximate the 
uniform measure on [0, 1] by the dyadic random variable 



Xm '■ (xi, 



M 



.XM)G{-l,ir^E^2- 



m=l 



One can then approximate / : [0, 1]' 



fM{x\, 



— 7- {0, 1} by the Boolean function Jm defined on 



1 5 



fi^M^ ■ ■ ■ ^^m) 



Still (as mentioned in the above heuristics) this proof requires two technical steps: a 
monotonization procedure and an "approximation" step (going from / to /a/). Since in 
our applications to sharp thresholds we used Theorems [III. 2 and III. 3 only in the case 
of monotone functions, for the sake of simplicity we will not present the monotonization 
procedure in these notes. 

Furthermore, it turns out that for our specific needs (the applications in Chapter 
III), we do not need to deal with the approximation part either. The reason is that for 
any Boolean function /, the function p i— I|(/) is continuous. Hence it is enough to 
obtain uniform bounds on Ifc(/) for dyadic values of p (i.e. p G {m2~^^} fl [0, 1]). 



See the exercises for the proof of Theorems III. 2 and III. 3 when / is assumed to be 



monotone (problem V.4). 



Remark V.5. We mentioned above that generalizing hypercontractivity would not allow 
us to obtain uniform bounds (with p taking any value in [0, 1]) on the influences. It 
should be noted though that Talagrand obtained ( |Tal94] ) results similar to Theorems 



III. 2 and III. 3 by somehow generalizing hypercontractivity, but along a different line. 
Finally, let us point out that both Talagrand ( |Tal94j ) and Friedgut and Kalai ( |FK96j ) 
obtain sharper versions of Theorems III. 2 and III. 3 where the constant c = Cp in fact 
improves (i.e. blows up) near the edges. 
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5 The noise sensitivity theorem 



In this section, we prove the milestone Theorem 1.5 from jBKS99j . Before recalhng 
what the statement is, let us define the following functional on Boolean functions. For 
any f : VL^ {0, 1}, let 

H(/):=$^I.(/)^=||Inf(/)i. 

k 

Recall the Benjamini-Kalai-Schramm Theorem. 

Theorem V.4 ( |BKS99] ). Consider a sequence of Boolean functions fn '■ — ^ {0, 1}. 
If 

m„ 

H{f^) = j2ufr^o 

k=l 

as n ^ oo, then {/„}„ is noise sensitive. 

We will in fact prove this theorem under a stronger condition, namely that H(/„) < 
(m„)~'^ for some exponent 6 > 0. Without this assumption of "polynomial decay" on 
H(/„), the proof is more technical and relies on estimates obtained by Talagrand. See 
the remark at the end of this proof. For our application to the noise sensitivity of 



percolation (see Chapter VI), this stronger assumption will be satisfied and hence we 
stick to this simpler case in these notes. 

The assumption of poljTiomial decay in fact enables us to prove the following more 
quantitative result. 

Proposition V.5 ( |BKS99j ). For any S > 0, there exists a constant M = M{S) > 
such that if fn '■ {0, 1} is any sequence of Boolean functions satisfying 

H(/„) < (m„)-^ 

then 

E fniSf^O. 

l<|5|<Mlog (m„) 



Using Proposition IV. 1, this proposition obviously implies Theorem 1.5 when H(/„) 
decays as assumed. Furthermore, this gives a quantitative "logarithmic" control on the 
noise sensitivity of such functions. 

Proof. The strategy will be very similar to the one used in the KKL Theorems (even 
though the goal is very different). The main difference here is that the regularization 
term p used in the hypercontractive estimate must be chosen in a more delicate way 
than in the proofs of KKL results (where we simply took p = 1/2). 
Let M > be a constant whose value will be chosen later. 
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l<|5|<Mlog(m„) l<|S|<Aflog{m„) k l<|5'|<Af log{m„) 



k ^ 



k p 



by Theorem V.2 



Now, since /„ is Boolean, one has ||Vfc/n||i+p2 = || Vfc/„||2''^^^^ hence 

0<|S'|<Mlog{7Tt„) k k 

< p-^^'°^("^")(m„)''^/(^+''^)(5^I,(/„)2)^ (byH51der) 

k 

< p-2Mlog(m„)(^^)^ 



Now by choosing p G (0, 1) close enough to 0, and then by choosing M = M{6) 
small enough, we obtain the desired logarithmic noise sensitivity. □ 

We now give some indications of the proof of Theorem |I.5| in the general case. 

Recall that Theorem 1.5 is true independently of the speed of convergence of H(/„) = 
Ylk ^kifnY- The proof of this general result is a bit more involved than the one we gave 
here. The main lemma is as follows: 

Lemma V.6 ( |BKS99] ). There exist absolute constants Ck such that for any monotone 
Boolean function f and for any k > 2, one has 



$^/W<C.H(/)(-logH(/)) 

\S\=k 



k-l 



This lemma "mimics" a result from Talagrand's |Tal96] . Indeed, Proposition 2.3 in 
|Tal96] can be translated as follows: for any monotone Boolean function /, its le yel-2 



Fourier weight (i.e. ^|5|=2 /("S*)^) is bounded by 0(1)H(/) log(l/H(/)). Lemma V.6 
obviously implies Theorem |I.5| in the monotone case, while the general case can be de- 
duced by a monotonization procedure. It is worth pointing out that hypercontractivity 
is used in the proof of this lemma. 



Appendix: proof of 
hypercontractivity 



The purpose of this appendix is to show that we are not using a giant "hammer" but 
rather that this needed inequahty arising from Fourier analysis is understandable from 
first principles. In fact, historically, the proof by Gross of the Gaussian case first looked 
at the case of the hypercube and so we have the tools to obtain the Gaussian case should 
we want to. Before starting the proof, observe that for p = (where 0° is defined to be 
1), this simply reduces to | / / | < / |/|. 



Proof of Theorem [¥721 



0.1 Tensorization 

In this first subsection, we show that it is sufficient, via a tensorization procedure, that 
the result holds for n = 1 in order for us to be able to conclude by induction the result 
for all n. 

The key step of the argument is the following lemma. 

Lemma V.7. Let q > p > I, (fii, /ii), (^^2) A^2) be two finite probability spaces, Ki : 
X — )• M and assume that for i = 1,2 

\\Ti{f)\\Lgin,,fj.,) < llfWipin^fj.,) 
where Ti{f){x) := J^^ f{y)Ki{x,y)dfXi{y). Then 

11^1 (g) T2(/)||L,((ni,/xi)x(Q2,M2)) < ||/IUp((ni,Mi)x(n2,M2)) 

where Ti ® T2(/)(xi, X2) := J^^^^^ f{yi,y2)Ki{xi,yi)K2{x2,y2)diJ.i{yi) x rfp2(y2)- 

Proof. One first needs to recall Minkowski's inequality for integrals, which states that, 
for g > and r G [1, 00), we have 

g{x,y)diy{y)] rfp(x)^ < / f / g{x,yydiJ,{x)] dv{y). 
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(Note that when u consists of 2 point masses each of size 1, then this reduces to the 
usual Minkowski inequahty.) 

One can think of Ti acting on functions of both variables by leaving the second 
variable untouched and analogously for T2. It is then easy to check that Ti®T2 = T10T2. 
By thinking of fixed, our assumption on Ti yields 



< 



^2 \Jn 



iT2(/)rrf/ii(xi 



(It might be helpful here to think of T2(/)(xi, function g^^{xi) where X2 is 

fixed) . 

Applying Minkowski's integral inequality to \T2{f)\^ with r = q/p, this in turn is at 
most 

p/q 



|T2(/)|''c//i2(x2) dfii{Xi) 



^2 



1 g/p 



Fixing now the Xi variable and applying our assumption on T2 gives that this is at most 
llfllr i/n \^fn xx, as desired. □ 

II'' IILp((SZi,/Ji)x(SZ2,M2))' 

The next key observation, easily obtained by expanding and interchanging of sum- 
mation, is that our operator Tp acting on functions on fi„ corresponds to an operator 
of the type dealt with in the previous lemma with K{x, y) being 

SC{l,...,n} 

In addition, it is easily checked that the function K for the Qn is simply an n-fold 
product of the function for the n = 1 case. 

Assuming the result for the case n = 1, Lemma V.7 and the above observations 
allow us to conclude by induction the result for all n. 



0.2 The n = 1 case 

We now establish the case n = 1. We abbreviate Tp by T. 

Since fix) = (/(-I) + /(l))/2 + (/(I) - /(-l))/2 x, we have Tfix) = (/(-I) + 
/(l))/2 + p(/(l) - /(-l))/2 X. Denoting (/(-I) + /(l))/2 by a and (/(I) - /(-l))/2 
by b, it suffices to show that for all a and b, we have 

(,2 + < \a + br^' + \a-b\^^^\ 

Using p G [0, 1], the case a = is immediate. For the 7^ 0, it is clear we can 

assume a > 0. Dividing both sides by a^"*"^ , we need to show that 

(1 + < n+^i'^'-' + n-^r^' (V.6) 
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for all y and clearly it suffices to assume y > 0. 

We ffist do the case that y G [0, 1). By the generalized Binomial formula, the right 



hand side of (V.6) is 



.k=0 



1+ p' 
k 



fc=0 



1 + p' 
k 



k=0 



1 + p' 

2k 



,2k 



For the left hand side of (|V.6|), we ffist note the following. For < A < 1, a simple 

(1 + - 



calculation shows that the function g{x] 
on [0, oo) and hence g{x) < on [0, oo). 



Xx has a negative derivative 



This yields that the left hand side of (V.6) is at most 

p y 



which is precisely the ffist two terms of the right hand side of (V.6). On the other 



hand, the binomial coefficients appearing in the other terms are nonnegative, since in 
the numerator there are an even number of terms with the ffist two terms being positive 
and all the other terms being negative. This verifies the desired inequality for y G [0, 1). 
The case y = 1 for (V.6) follows by continuity. 

For y > 1, we let z = 1/ y and note, by multiplying both sides of (V.6) by , we 
need to show 



(^2^p2)(lV)/2< 

Now, expanding (1 — — / 



l + z 



+ 1 



(V.7) 



)^), one sees that 2;^ + < 1 + z'^p^ and hence the 
desired inequality follows precisely from (V.6) for the case y G (0,1) already proved. 
This completes the n = 1 case and thereby the proof. □ 
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Exercise sheet of Chapter |V 



Exercise V.l. Find a direct proof that Theorem |I.3| implies Theorem 1.2 



Exercise V.2. Is it true that the smaller the influences are, the more noise sensitive 
the function is? 



Exercise V.3. Prove that Theorem V.3 indeed implies Theorem III.2 
Hint: use the natural projection. 



Problem V.4. In this problem, we prove Theorems |III.2| and |III.3| for the monotone 
case. 



1. Show that Theorem TTL3] implies III. 2 and hence one needs to prove only Theorem 
III. 3 (This is the basically the same as Exercise V.l[ ). 



2. Show that it suffices to prove the result when p = for integers k and i. 

3. Let n : {0, 1}^ -> {0, l/2^ . . . , (2^ - l)/2^} by U{xi, . . . , x,) = Eti Ob- 
serve that if x is uniform, then n(x) is uniform on its range and that P(n(a:;) > 



XI 



4. Define g : {0, 1}^ {0, 1} by g{xi, ...,Xi) := /{n(x)>i-p}- Note that F{g{ 
l)=p. 

5. Define / : {0, 1}"^ ^ {0, 1} by 

/(X]^, . . . , X£, X]^, . . . , x^, . . . , , . . . , X£ ) = fi^gix^^ . . . ,Xf,), g{x^, . . . , x^), . . . , g^x^ , . . . , x^ )). 

Observe that / (defined on ({0, 1}"^, 711/2)) and / (defined on ({0, 1}", vr^)) have 
the same distribution and hence the same variance. 

6. Show (or observe) that I(rj)(/) < I^(/) for each r = 1, . . . ,n and j = 
Deduce from Theorem 11.31 that 



J]l(.,,)(/)>cVar(/)log(l/5p) 



where 5p := maxil^(/) where c comes from Theorem 1.3 
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7. (Key step). Show that for each r — 1, . . . ,n and j — 1, . . . ,i, 

8. Combine parts 6 and 7 to complete the proof. 



Chapter VI 



First evidence of noise sensitivity of 
percolation 



In this lecture, our goal is to collect some of the facts and theorems we have seen so far 
in order to conclude that percolation crossings are indeed noise sensitive. Recall from 



the "BKS" Theorem (Theorem 1.5) that it is enough for this purpose to prove that 
influences are "small" in the sense that '^/.'i-kifnY goes to zero. 

In the first section, we will deal with a careful study of influences in the case of 
percolation crossings on the triangular lattice. Then, we will treat the case of Z^, 
where conformal invariance is not known. Finally, we will speculate to what "extent" 
percolation is noise sensitive. 

This whole chapter should be considered somewhat of a "pause" in our program, 
where we take the time to summarize what we have achieved so far in our understanding 
of the noise sensitivity of percolation, and what remains to be done if one wishes to prove 
things such as the existence of exceptional times in dynamical percolation. 



1 Bounds on influences for crossing events in criti- 
cal percolation on the triangular lattice 

1.1 Setup 

Fix a,b > 0, let us consider some rectangle [0, a ■ n] x [0, 6 ■ n], and let i?„ be the set 
of of hexagons in T which intersect [0, a ■ n] x [0, 6 • n]. Let /„ be the event that there 
is a left to right crossing event in (This is the same event as in Example [7| in 
chapter [l| but with replaced by T). By the RSW Theorem II. 1, we know that {fn} 



is non-degenerate. Conformal invariance tells us that E[/„] = P[/„ = l] converges as 
n — > oo. The limit is given by the so-called Cardy's formula. 

In order to prove that this sequence of Boolean functions {/„} is noise sensitive. 
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we wish to study its influence vector Inf(/„) and we would like to prove that H(/„) = 
||Inf(/„)||2 = decays polynomially fast towards 0. (Recall that in these notes, 

we gave a complete proof of Theorem 1.5 only in the case where H(/„) decreases as an 
inverse polynomial of the number of variables.) 



1.2 Study of the set of influences 

Let a: be a site (i.e. a hexagon) in the rectangle Rn- One needs to understand 

^xifn) := P[a; is pivotal for /„] 



It is easy but crucial to note that if x is 
at distance d from the boundary of Rn, in 
order for x to be pivotal, the four-arm event 
described in Chapter [ll] (see Figure II. 2) has 
to be satisfied in the ball B{x, d) of radius 
d around the hexagon x. See the figure on 
the right. 




In particular, this implies (still under the assumption that dist(x, (9-R„) 

Ufn)<a,{d) = d--^+"'^'\ 



d) that 



where a^^d) denotes the probability of the four-arm event up to distance d. See Chapter 
im The statement 

implies that for any e > 0, there exists a constant C = C^, such that for all R> 1, 



0^4 



(R) < CR'^l^+\ 



The above bound gives us a very good control on the influences of the points in the 
hulk of the domain (i.e. the points far from the boundary). Indeed, for any fixed 5 > 0, 
let be the set of hexagons in R^ which are at distance at least 5n from dRn- Most 
of the points in R^ (except a proportion 0(5) of these) lie in A^, and for any such point 
X G A^, one has by the above argument 



(VI. 1) 



Therefore, the contribution of these points to H(/„) = Ylik'^kUn)'^ is bounded by 
0(n^)(C(5~^/^n~^/^+'')^ = ©(^'^/^n"-^/^"^^"). As n — )■ oo, this goes to zero polynomially 
fast. Since this estimate concerns "almost" all points in it seems we are close to 
proving the BKS criterion. 
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1.3 Influence of the boundary 

Still, in order to complete the above analysis, one has to estimate what the influence 
of the points near the boundary is. The main difficulty here is that if x is close to 
the boundary, the probability for x to be pivotal is not related anymore to the above 
four-arm event. Think of the above figure when d gets very small compared to n. One 
has to distinguish two cases: 

• X is close to a corner. This will correspond to a two- arm event in a quarter-plane. 

• X is close to an edge. This involves the three-arm event in the half-plane H. 

Before detailing how to estimate the infiuence of points near the boundary, let us 
start by giving the necessary background on the involved critical exponents. 

The two-arm and three-arm events in EI . For these particular events, it turns out 
that the critical exponents are known to be universal: they are two of the very few crit- 
ical exponents which are known also on the square lattice Z^. The derivations of these 
types of exponents do not rely on SLE technology but are "elementary". Therefore, in 
this discussion, we will consider both lattices T and Z^. 

The three- arm event in EI corre- 
sponds to the event that there are 
three arms (two open arms and 
one 'closed' arm in the dual) go- 
ing from to distance R and such 
that they remain in the upper half- 
plane. See the figure for a self- 
explanatory definition. The two- 
arm event corresponds to just hav- 
ing one open and one closed arm. 

Let (R) and (y^{R) denote the probabilities of these events. As in chapter |ll| let 
a^(r, R) and a^{r, R) be the natural extensions to the annulus case (i.e. the probability 
that these events are satisfied in the annulus between radii r and R in the upper half- 
plane). 

We will rely on the following result, which goes back as far as we know to M. 
Aizenman. See |Wer07j for a proof of this result. 

Proposition VI. 1. Both on the triangular lattice T and on 1? , one has that 

at{r,R)-{r/R) 

and 

a+(r,i?)x(r/i?)2. 

Note that, in these special cases, there are no o(l) correction terms in the exponent. 
The probabilities are in this case known up to constants. 
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The two- arm event in the quarter-plane. In this case, the corresponding exponent 
is unfortunately not known on Z^, so we will need to do some work here in the next 
section, where we will prove noise sensitivity of percolation crossings on Z^. 

The two- arm event in a corner corresponds to 
the event illustrated on the following picture. 
We will use the following proposition: 

Proposition VI.2 r |SW01] ). // at^{R) de- 
notes the probability of this event, then 

at^{R) = R-^+°^^\ 

and with the obvious notations 

a++{r,R) = (r/i?)2+°(i) . 

Now, back to our study of influences, we are in good shape (at least for the triangular 
lattice) since the two critical exponents arising from the boundary effects are larger than 
the bulk exponent 5/4. This means that it is less likely for a point near the boundary 
to be pivotal than for a point in the bulk. Therefore in some sense the boundary helps 
us here. 

More formally, summarizing the above facts, for any e > 0, there is a constant 
C = C{e) such that for any 1 < r < i?, 

max{a4(r,/2),a^(r,i?),a++(r,i?)} < C{r/R)'^-\ (VI.2) 

Now, if X is some hexagon in let uq be the distance to the closest edge of dRn 
and let Xq be the point on dRn such that dist(a;,Xo) = Uq. Next, let ni > Uq be the 
distance from xq to the closest corner and let xi be this closest corner. It is easy to see 
that for X to be pivotal for /„, the following events all have to be satisfied: 

• The four-arm event in the ball of radius no around x. 

• The H-three-arm event in the annulus centered at xo of radii 2no and ni. 

• The corner-two-arm event in the annulus centered at xi of radii 2ni and n. 

By independence on disjoint sets, one thus concludes that 

Ix(/n) < 04(^0) a^(2no,ni) a^+(2ni,n) 
< 0(l)n-^/^+^ 
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1.4 Noise sensitivity of crossing events 

This uniform bound on the influences over the whole domain i?„ enables us to conclude 
that the BKS criterion is indeed verified. Indeed, 

H(/„) = Yl Ufnf < Cn\n-"'^^f = Cn-'/'^'^ , (VI.3) 

where C = C{a, b, e) is a universal constant. By taking e < 1/4, this gives us the desired 
polynomial decay on H(/„), which by Proposition V.5) implies 

Theorem VI.3 ( |BKS99] ). The sequence of percolation crossing events {/„} on T is 
noise sensitive. 

We will give some other consequences (for example, to sharp thresholds) of the 
above analysis on the influences of the crossing events in a later section. 



2 The case of percolation 

Let Rn denote similarly the rectangle closest to [0, a - n] x [0, & ■ and let be the 
corresponding left-right crossing event (so here this corresponds exactly to example [?]). 
Here one has to face two main difficulties: 

• The main one is that due to the missing ingredient of conformal invariance, one 
does not have at our disposal the value of the four-arm critical exponent (which 
is of course believed to be 5/4). In fact, even the existence of a critical exponent 
is an open problem. 

• The second difficulty (also due to the lack of conformal invariance) is that it is 
now slightly harder to deal with boundary issues. Indeed, one can still use the 
above bounds on which are universal, but the exponent 2 for a^'*' is not known 
for Z^. So this requires some more analysis. 

Let us start by taking care of the boundary effects. 



2.1 Handling the boundary effect 

What we need to do in order to carry through the above analysis for is to ob- 
tain a reasonable estimate on a^"^. Fortunately, the following bound, which follows 



immediately from Proposition VI. 1 is sufficient. 



a++(r,i?)<0(l)^. (VI.4) 

Now let e be an edge in We wish to bound from above Ie(/n)- We will use 
the same notation as in the case of the triangular lattice: recall the definitions of 
no, xq, ni,xi there. 
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We obtain in the same way 

Ie(/n) < Oi^ino) ^3 (2no, ni) a^+(2ni, n) . 



(VI.5) 



At this point, we need another universal exponent, which goes back also to M. 
Aizenman: 

Theorem VI. 4 (M. Aizenman, see |Wer07] ). Let a5{r,R) denote the probability that 
there are 5 arms (with four of them being of 'alternate colors'). Then there are some 
universal constants c,C > such that both for T and 1? , one has for all 1 < r < R, 

c{^y<a,{r,R)<cQ\ 

This resuh allows us to get a lower bound on 04 (r, R). Indeed, it is clear that 

tt4(r, R) > a^ir, R) > n{l)a^{r, R) . (VI.6) 

In fact, one can obtain the following better lower bound on 04 (r, R) which we will 
need later. 

Lemma VI.5. There exists some e > and some constant c > such that for any 
l<r <R, 

a^{r,R) > c{r/Rf-'. 

Proof. There are several ways to see why this holds, none of them being either very 
hard or very easy. One of them is to use Reimer's inequality (see |Gri99] ) which in 
this case would imply that 



a^{r,R) < ai{r, R)a4{r, R) 
The RSW Theorem III. II can be used to show that 

ai{r,R) < {r/RY 



(VI.7) 



for some positive a. By Theorem VI. 4, we are done. [See [ |GPS10] . Section 2.2 as well 
as the appendix] for more on these bounds.] □ 



Combining (VI.5) with (VI.6), one obtains 



Ie(/n) < 0(1)04(^0)04(2%, ni)a^+(2ni,ra) 



< 0(1)04(^1) 



n 



where in the last inequality we used quasi-multiplicativity (Proposition II. 3) as well as 
the bound given by (VI.4). 
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Recall that we want an upper 
bound on H(/„) = Y.Mfnf- In 
this sum over edges e E Rn, let us 
divide the set of edges into dyadic 
annuli centered around the 4 cor- 
ners as in the next picture. 



Notice that there are 0(1)2^ edges in an annulus of radius 2 . This enables us to 




bound H(/„) as follows: 



log2 n+0(l) 



j^uur < 0(1) ("^(2' 



2feN 2 



n 



< 0(1) \ V 2^'=a4(2'^) 



(VI.8) 



fc<log2 n+0(l) 

It now remains to obtain a good upper bound on 04 (-R), for all i? > 1. 



2.2 An upper bound on the four- arm event in 1? 

This turns out to be a rather non-trivial problem. Recall that we obtained an easy 
lower bound on using (and Lemma VI. 5 strengthens this lower bound). For an 
upper bound, completely different ideas are required. On Z^, the following estimate is 
available for the 4- arm event. 

Proposition VI. 6. For critical percolation on 1? , there exists constants e,C > such 
that for any R> 1, one has 



a,{l,R)<c(^-) 



1 N 1 + . 



Before discussing where such an estimate comes from, let us see that it indeed 
implies a polynomial decay for H(/„). 

Recall equation (VI.8). Plugging in the above estimate, this gives us 



YWn? < 0(1)1 y: 2^'^(2 



-2-2e 



e^Rn 



fc<log2 n+0(l) 



< 0(1)- 



-n 



2-2e 



0{l)n 



-2e 
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which imphes the desired polynomial decay and thus the fact that {fn} is noise sensitive 



by Proposition V.5) 



Let us now discuss different approaches which enable one to prove Proposition VI.6 



(a) Kesten proved implicitly this estimate in his celebrated paper |Kes87] . His main 
motivation for such an estimate was to obtain bounds on the corresponding critical 
exponent which governs the so-called critical length. 

(b) In |BKS99j . in order to prove noise sensitivity of percolation using their criterion 
on H(/„), the authors referred to |Kes87] . but they also gave a completely different 
approach which also yields this estimate. 

Their alternative approach is very nice: finding an upper bound for 04 (i?) is 
related to finding an upper bound for the influences for crossings of a.n Rx R box. 
For this, they noticed the following nice phenomenon: if a monotone function / 
happens to be very little correlated with majority, then its influences have to be 
small. The proof of this phenomenon uses for the first time in this context the 
concept of "randomized algorithms". For more on this approach, see Chapter 



VIII which is devoted to these types of ideas. 



(c) In |SS10bj . the concept of randomized algorithms is used in a more powerful way. 



See again Chapter |VIII[ In this chapter, we provide a proof of this estimate in 
Proposition |VIII.8 



Remark VI. 1. It turns out that that a multi-scale version of Proposition VI.6 stating 
that 0:4 (r,i?) < Clj^] is also true. However, none of the three arguments given 
above seem to prove this stronger version. A proof of this stronger version is given in 
the appendix of |SS10aj . Since this multi-scale version is not needed until Chapter |X| 
we stated here only the weaker version. 



3 Some other consequences of our study of influ- 
ences 

In the previous sections, we handled the boundary effects in order to check that H(/„) 
indeed decays polynomially fast. Let us list some related results implied by this analysis. 



3.1 Energy spectrum of 

We start by a straightforward observation: since the /„ are monotone, we have by 

M{x}) = luU), 



Proposition IV.4 that 
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for any site x (or edge e) in Therefore, the bounds we obtained on H(/„) imply the 
following control on the first layer of the energy spectrum of the crossing events {fn}- 

Corollary VI. 7. Let {fn} be the crossing events of the rectangles Rn- 

• If we are on the triangular lattice T, then we have the hound 

\s\=i 

• On the square lattice 1? , we end up with the weaker estimate 

EfSl)<Cn-% 

for some e, C > 0. 
3.2 Sharp threshold of percolation 

The above analysis gave an upper bound on ^fclfc(/n)^- As we have seen in the first 
chapters, the total influence I(/n) = is also a very interesting quantity. Recall 

that, by Russo's formula, this is the quantity which shows "how sharp" the threshold 
is for p^Pp [/„ = !]. 

The above analysis allows us to prove the following. 

Proposition VI. 8. Both on T and 7? , one has 

Ufn) ^ n^aA{n) . 
In particular, this shows that on T that 

I(/„) X n3/^+°(^) . 

Remark VI.2. Since /„ is defined on { — 1, note that the Majority function 
defined on the same hypercube has a much sharper threshold than the percolation 
crossings /„. 

Proof. We first derive an upper bound on the total influence. In the same vein (i.e.. 



using dyadic annuli and quasi-multiphcativity) as we derived (VI.8) and with the same 
notation one has 



n 



I(/.) = 5^Ie(/n) < Y.O(l)a,{n,] 

e e 

< Oil)- V 23%(2* 



n 

fc<log2n+0(l) 
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Now, and this is the main step here, using quasi-multiphcativity one has 04(2'^) < 

' a4(2'=,n) ' 



'^(l)^^fe' ^hich giv^s us 



i(/„) < 0(1)^^ j: 



■,3k 



fc<log2n+0(l) ^ ' ' 



,2 



< 0(1)^ 2^'!^ since Mr, i?)>a5(r,i?)x (r/i?)-2 

fc<log2 n+0(l) 

< 0(l)na4H ^ 1^ 

fc<log2 n+0(l) 

< 0(l)n2a4(n) 

as desired. 

For the lower bound on the total influence, we proceed as follows. One obtains 
a lower bound by just summing over the influences of points whose distance to the 
boundary is at least n/4. It would suffice if we knew that for such edges or hexagons, 
the influence is at least a constant times 04(72). This is in fact known to be true. It 
is not very involved and is part of the folklore results in percolation. However, it still 
would lead us too far from our topic. The needed technique is known under the name 
of separation of arms and is clearly related to the statement of quasi-multiplicativity. 
See |Wer07j for more details. □ 



4 Quantitative noise sensitivity 

In this chapter, we have proved that the sequence of crossing events {/„} is noise 
sensitive. This can be roughly translated as follows: for any fixed level of noise e > 
0, as n — > 00, the large scale clusters of u in the window [0,72]^ are asymptotically 
independent of the large clusters of u^. 

Remark VI. 3. Note that this picture is correct, but in order to make it rigorous, this 
would require some work, since so far we only worked with left-right crossing events. 
The non-trivial step here is to prove that in some sense, in the scaling limit n — t- 00, any 
macroscopic property concerning percolation (e.g., diameter of clusters) is measurable 
with respect to the cx-algebra generated by the crossing events. This is a rather subtle 
problem since we need to make precise what kind of information we keep in what we 
call the "scaling limit" of percolation (or subsequential scaling limits in the case of 
Z^). An example of something which is not present in the scaling limit is whether one 
has more open sites than closed ones since by noise sensitivity we know that this is 
asymptotically uncorrelated with crossing events. We will not need to discuss these 
notions of scaling limits more in these lecture notes, since the focus is mainly on the 
discrete model itself including the model of dynamical percolation which is presented 
at the end of these lecture notes. 
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At this stage, a natural question to ask is to what extent the percolation picture is 
sensitive to noise. In other words, can we let the noise e = e„ go to zero with the "size 
of the system" n, and yet keep this independence of large scale structures between u 
and cUe^ ? If yes, can we give quantitative estimates on how fast the noise e = e„ may 
go to zero? One can state this question more precisely as follows. 

Question VI. 1. // {/«} denote our left-right crossing events, for which sequences of 
noise-levels {e„} do we have 

lim Cov[/„,(w), fniuJeJ] = ? 

n— >oo 

The purpose of this section is to briefly discuss this question based on the results 
we have obtained so far. 

4.1 Link with the energy spectrum of {/„} 



It is an exercise to show that Question VI. 1| is essentially equivalent to the following 
one. 

Question VI. 2. For which sequences {kn} going to infinity do we have 

m=l l<|S|<fc„ 

Recall that we have already obtained some relevant information on this question. 
Indeed, we have proved in this chapter that H(/„) = ^^Ia;(/n)^ decays polynomially 



fast towards (both on 1? and T). Therefore Proposition V.5 tells us that for some 
constant c > 0, one has for both T and 7? that 

E /"(^)'^0. (VI.9) 

l<|5|<clogn 



Therefore, back to our original question VI. 1 , this gives us the following quantitative 



statement: if the noise e„ satisfies e„ ^ j;^, then fnioj) and fni^tn) are asymptotically 
independent. 



4.2 Noise stabiUty regime 

Of course, one cannot be too demanding on the rate of decay of {e„}. For example if 
^ then in the window [0, n]^, with high probability, the configurations u and u^^^ 
are identical. This brings us to the next natural question concerning the noise stability 
regime of crossing events. 
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Question VI. 3. Let {fn} be our sequence of crossing events. For which sequences {e„} 
do we have 

F[Uuj)^fnM] ^0? 

n— >-oo 

It is an exercise to show that this question is essentially equivalent to the following 
one. 

For which sequences {kn} do we have 

E fniSf ^ ? 

\S\>kn 

Using the estimates of the present chapter, one can give the following non-trivial 
bound on the noise stability regime of {fn}- 

Proposition VI. 9. Both on 1? and T, if 

en^o(^—) 
" Vn^a4(n)/ ' 

then 

P[fn{co) fniuJej] ^ 

n— >-oo 

On the triangular grid, using the critical exponent, this gives us a bound of on 
the noise stability regime of percolation. 

Proof. Let {e„} be a sequence satisfying the above assumption. There are O(n^) bits 
concerned. For simplicity, assume that there are exactly bits. Let us order these in 
some arbitrary way: {xi, . . . , x„2} (or on Z^, {ci, . . . , e„2}). 

Let cu = cuq = {xi, . . . ,a:„2) be sampled according to the uniform measure. Recall 
that the noised configuration Ue„ is produced as follows: for each i G [n^] , resample the 
bit Xi with probability e^, independently of everything else, obtaining the bit y^. (In 
particular tji ^ Xi with probability Cn/'i). 

Now for each i e [n^\ define the intermediate configuration 

•^i (yi, ■ ■ ■,yi,Xi+i, . . . ,x„2) 

Notice that for each i G [n^], Ui is also sampled according to the uniform measure 
and one has for each i G {1, . . . , n^} that 
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Summing over all i, one obtains 

n2-l 

< 5^P[/„(a;,)^/n(^m)] 



which concludes the proof. 



i=0 



(e„/2) J2l.Xfn 



i=l 



= (e„/2)I(/0 

< e„0(l)n^a;4(ra) by Proposition [VTS 



□ 



4.3 Where does the spectral mass Ues? 



Proposition VI. 9 (together with Exercise IX. 2 in Chapter IX) implies that the Fourier 



coefficients of {/„} satisfy 



J2 Usf-^o. 



(VI.IO) 



|5|>n2a4(n) 



From Lemma VI. 5, we know that even on Z^, n^Q;4(n) is larger than for some 
exponent e > 0. Combining the estimates on the spectrum that we achieved so far 



(equations (VI. 9) and (VI.IO)), we see that in order to localize the spectral mass of 



{/„}, there is still a missing gap. See Figure VI. 1 



For our later applications to the model of dynamical percolation (in the last chapter 
of these lecture notes), a better understanding of the noise sensitivity of percolation 
than the "logarithmic" control we achieved so far will be needed. 



70 CHAPTER VI. FIRST EVIDENCE OF NOISE SENSITIVITY OF PERCOLATION 




k 



clogn ^3/4+0(1) 



Figure VI. 1: This picture summarizes our present knowledge of the energy spectrum 
of {/n} on the triangular lattice T. Much remains to be understood to know where, in 
the range [il{\ogn) , n^^'^~^°^^^], the spectral mass lies. This question will be analyzed in 
the following chapters. 



Exercise sheet on chapter 



Instead of being the usual exercise sheet, this page will be devoted to a single Problem 
whose goal will be to do "hands-on" computations of the first layers of the energy 



spectrum of the percolation crossing events /„. Recall from Proposition IV. 1 that a 
sequence of Boolean functions {/n} is noise sensitive if and only if for any fixed k > 1, 



m=l |S'|=m 



^m) — > . 

n— >oo 



m=l 



In the present chapter, we obtained (using Proposition IV. 4) that this is indeed 
the case for k = 1. The purpose here is to check by simple combinatorial arguments 
(without relying on hypercontractivity) that it is still the case for k = 2 and to convince 
ourselves that it works for all layers A; > 3. 

To start with, we will simplify our task by working on the torus 1? jnl? . This has 
the very nice advantage that there are no boundary issues here. 



Energy spectrum of crossing events on the torus (study 
of the first layers) 

Let Tn be either the square grid torus 1? jnT? or the triangular grid torus T/nT. Let 
/„ be the indicator of the event that there is an open circuit along the first coordinate 
of T„. 

1. Using RSW, prove that there is a constant c > such that for all n > 1, 

C<P[/n = l] < 1-C. 

(In other words, {/„} is non-degenerate.) 

2. Show that for all edges e (or sites x) in T„ 

T) 

Ufn)<M-). 
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3. Check that the BKS criterion (about H(/„)) is satisfied. Therefore {fn} is noise- 
sensitive 

From now on, one would hke to forget about the BKS Theorem and try to do 
some hands-on computations in order to get a feehng why most frequencies should 
be large. 

4. Show that if x, y are two sites of T„ (or similarly if e, e' are two edges of T„), then 

\f{{x,y})\ < 2P[ X and y are pivotal points] . 
Does this result hold for general Boolean functions? 

5. Show that if d := \x — y\, then 

04(^/2)2 



P[ X and y are pivotal points ] < 0(1) 



(Hint: use Proposition II. 3 



6. On the square lattice Z^, by carefully summing over all edges e, e' G T„ x T„, show 
that 

Ej^{2) = J2 lis? < 0{l)n-\ 

\S\=2 

for some exponent e > 0. 

Hint: you might decompose the sum in a dyadic way (as we did many times in 
the present section) depending on the mutual distance d{e,e'). 

7. On the triangular grid, what exponent does it give for the decay of Ej^(2)? Com- 



pare with the decay we found in Corollary VI. 7 about the decay of the first layer 



Ef^{l) (i.e. k = 1). See also Lemma V.6 in this regard. Discuss this. 



8. For T, what do you expect for higher (fixed) values of fc? (I.e. for Ef^{k), k > 3) 
? 

9. (Quite hard) Try to obtain a nonrigorous combinatorial argument similar to the 
one above in the particular case k = 2, that for any fixed layer k > 1, 



EfM ^ 0. 

n— >oo 

This would give us an alternative proof of noise sensitivity of percolation (at least 



in the case of the torus T„) not relying on Theorem 1.5 



Observe that one can do similar things for rectangles but then one has to deal with 
boundary issues. 



Chapter VII 
Anomalous fluctuations 



In this lecture, our goal is to extend the technology we used to prove the KKL Theorems 
on influences and the BKS Theorem on noise sensitivity to a slightly different context: 
the study of fluctuations in first passage percolation. 

1 The model of first passage percolation 

Let us flrst explain what the model is. Let < a < 6 be two positive numbers. We 
deflne a random metric on the graph Z"^, > 2 as follows. Independently for each 
edge e G E"', flx its length Te to be a with probability 1/2 and b with probability 1/2. 
This is represented by a uniform conflguration u G { — 1,1}'^ . 

This procedure induces a well-deflned (random) metric dist^ on in the usual 
fashion. For any vertices x,y & Z'^, let 

distu;{x,y) := inf |^re^(cj)|. 

paths 7 = {ei,. . . ,efc} 
connecting x ^ y 

Remark Vll.l. In greater generality, the lengths of the edges are i.i.d. non-negative 
random variables, but here, following |BKS03j . we will restrict ourselves to the above 
uniform distribution on {a, b} to simplify the exposition; see |BR08j for an extension 
to more general laws. 

One of the main goals in flrst passage percolation is to understand the large-scale 
properties of this random metric space. For example, for any T > 1, one may consider 
the (random) ball 

B^{x,T) := {yeZ'': dist,(x,i/) < T}. 

To understand the name first passage percolation, one can think of this model as 
follows. Imagine that water is pumped in at vertex x, and that for each edge e, it takes 
Te{uj) units of time for the water to travel across the edge e. Then, Bi^{x,T) represents 
the region of space that has been wetted by time T. 
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Figure VII. 1: A sample of a wetted region at time T, i.e. B^{x,T), in first passage 
percolation. 

An application of subadditivity shows that the renormalized ball |;i?^(0,T) con- 
verges as T — )■ cxD towards a deterministic shape which can in certain cases be com- 
puted explicitly. This is a kind of "geometric law of large numbers". Whence the 
natural question: 

Question VII. 1. Describe the fluctuations of Bi^{0,T) around its asymptotic deter- 
ministic shape. 

This question has received tremendous interest in the last 15 years or so. It is 
widely believed that these fluctuations should be in some sense "universal". More 
precisely, the behavior of Bi^{0, T) around its limiting shape should not depend on the 
"microscopic" particularities of the model such as the law on the edges lengths but only 
on the dimension d of the underlying graph. The shape itself depends on the other hand 
of course on the microscopic parameters, in the same way as the critical point depends 
on the graph in percolation. 

In the two-dimensional case, using very beautiful combinatorial bijections with ran- 
dom matrices, certain cases of directed last passage percolation (where the law on the 
edges is taken to be geometric or exponential) have been understood very deeply. For 
example, it is known (see |JohOO] ) that the fluctuations of the ball of radius n (i.e. 
the points whose last passage times are below n) around n times its asymptotic deter- 
ministic shape are of order n}/^ and the law of these fluctuations properly renormalized 
follows the Tracy- Widom distribution. Very interestingly, the fluctuations of the largest 
eigenvalue of GUE ensembles also follow this distribution. 
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2 State of the art 

Returning to our initial model of (non-directed) first passage percolation, it is thus 
conjectured that, for dimension d = 2, fiuctuations are of order n^^^ following a Tracy- 
Widom Law. Still, the current state of understanding of this model is far from this 
conjecture. 

Kesten first proved that the fiuctuations of the ball of radius n are at most ^/n (this 
did not yet exclude a possible Gaussian behavior with Gaussian scaling). Benjamini, 
Kalai and Schramm then strengthened this result by showing that the fiuctuations are 
sub-Gaussian. This is still far from the conjectured n^/'^-fiuctuations, but their approach 
has the great advantage of being very general; in particular their result holds in any 
dimension d >2. 

Let us now state their main theorem concerning the fiuctuations of the metric dist. 

Theorem VII. 1 ( |BKS03] ). For all a,b,d, there exists an absolute constant C = 
C{a, b, d) such that in TL^ , 

Var(dist^(0,t;)) < 

log \v\ 

for any v G Z^, \v\ > 2. 

To keep things simple in these notes, we will only prove the analogous statement on 
the torus where one has more symmetries and invariance to play with. 



3 The case of the torus 

Let be the d-dimensional torus (Z/mZ)*^. As in the above lattice model, indepen- 
dently for each edge of T^, we choose its length to be either a or 6 equally likely. We are 
interested here in the smallest length among all closed paths 7 "winding" around the 
torus along the first coordinate Z/mZ (i.e. those paths 7 which when projected onto 
the first coordinate have winding number one). In |BKS03] . this is called the shortest 
circumference. For any configuration tu G {a,b}^^'^"^\ this shortest circumference is 
denoted by Circm(c<;). 

Theorem VII. 2 ( |BKS03j ). There is a constant C = C{a,b) (which does not depend 
on the dimension d), such that 

Tfl 

var(CirCm(w)) < C- . 

logm 

Remark VIL2. A similar analysis as the one carried out below works in greater gen- 
erality: ii G = {y,E) is some finite connected graph endowed with a random metric 
d^ with oj G {a, 6}®'^, then one can obtain bounds on the fiuctuations of the random 
diameter D = D^^ of {G,di^). See |BKS03| Theorem 2] for a precise statement in this 
more general context. 
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Figure VII. 2: The shortest geodesic along the first coordinate for the random metric 
dist(^ on (Z/mZ)^. 

Proof. 

For any edge e, let us consider the gradient along the edge e: VeCirCm- These 
gradient functions have values in [—{b — a),b — a]. By dividing our distances by the 
constant factor b — a, we can even assume without loss of generality that our gradient 
functions have values in [—1, 1]. Doing so, we end up being in a setup similar to the 
one we had in Chapter |V| The influence of an edge e corresponds here to le(CirCm) := 
P[VeCirc™(w) ^0]. We will prove later on that Circm has very small influences. In 
other words, we will show that the above gradient functions have small support, and 
hypercontractivity will imply the desired bound. 

We have thus reduced the problem to the following general framework. Consider a 
real- valued function / : { — 1, 1}" — )■ M, such that for any variable k, V^/ G [—1, 1]. We 
are interested in Var(/) and we want to show that if "influences are small" then Var(/) 
is small. It is easy to check that the variance can be written 




If all the variables have very small influence, then, as previously, V^/ should be of high 
frequency. Heuristically, this should then imply that 

Var(/) « 

k 



3. THE CASE OF THE TORUS 
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This intuition is quantified by the following lemma on the link between the fluctu- 
ations of a real-valued function / on fi„ and its influence vector. 

Lemma VII. 3. Let / : fi^ — )■ M 6e a (real-valued) function such that each of its discrete 
derivatives V^/, k G [n] have their values in [—1, 1]. Let lk{f) '■= PfVfc/ 7^ O] be the 
influence of the kth bit. Assume that the influences of f are small in the sense that 
there exists some a > such that for any k E {I, . . . ,n}, lk{f) < n'"'. Then there is 
some constant C = C{a) such that 

Var(/)<-^5^I,(/). 
logn ^-^ 



Remark VII. 3. If / is Boolean, then this follows from Theorem 1.3 with C{a) = c/a 
with c universal. 

The proof of this lemma is postponed to the next section. In the meantime, let us 
show that in our special case of first passage percolation on the torus, the assumption 
on small influences is indeed verified. Since the edge lengths are in {a, b}, the smallest 
contour CirCm(to') in around the first coordinate lies somewhere in [am,bm]. Hence, 
if 7 is a geodesic (a path in the torus with the required winding number) satisfying 
length(7) = CirCm(c<;), then 7 uses at most edges. There might be several different 
geodesies minimizing the circumference. Let us choose randomly one of these in an 
"invariant" way and call it 7. For any edge e G i?(T^), if, by changing the length of 
e, the circumference increases, then e has to be contained in any geodesic 7, and in 
particular in 7. This implies that P[VeCirCm,(w) > O] < P[e G 7]. By symmetry we 
obtain that 

le(Circ^) = P[VeCirc™(u;) 7^ O] < 2P[e G 7] ■ 

Now using the symmetries both of the torus TJ^ and of our observable Circm, if 7 
is chosen in an appropriate invariant way (uniformly among all geodesies for instance), 
then it is clear that all the "vertical" edges (meaning those edges which, when projected 
onto the first coordinate, project onto a single vertex) have the same probability to lie 
in 7. The same is true for the "horizontal" edges. In particular we have that 

P[e G 7] < E[|7|] < -m. 

"vertical" edges e 

Since there are at least order m'^ vertical edges, the influence of each of these is bounded 
by 0{l)m^^'^. The same is true for the horizontal edges. All together this gives the 



desired assumption needed in Lemma |VII.3[ Applying this lemma, we indeed obtain 
that 

m 

Var(Circ„(a;)) <0(1)- , 

logm 

where the constant does not depend on the dimension d; the dimension in fact helps us 
here, since it makes the influences smaller. □ 
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Remark VII. 4. At this point, we know that for any edge e, le(CirCm) = O(^). Hence, 
at least in the case of the torus, one easily deduces from Poincare's inequality the 
theorem by Kesten which says that Var(CirCm) = 0{m). 



4 Upper bounds on fluctuations in the spirit of KKL 



In this section, we prove Lemma VII. 3 



Proof. Similarly as in the proofs of Chapter |V} the proof relies on implementing 
hj^ercontractivity in the right way. We have that for any c. 



0(1) 



k 0<|S|<clogn ° k 

where the 0(1) term depends on the choice of c. 

Hence it is enough to bound the contribution of small frequencies, < 15*1 < clogn, 
for some constant c which will be chosen later. As previously we have for any p G (0, 1) 
and using hypercontractivity. 



J2 E ^(^)' ^ p-'^^°'"Eii^p^^/ii 

k 0<|5|<clogn k 



k 
k 

^ k 

< p~^^^°^"?T, "i+p^ by our assumption . 

k 

(VII. 1) 

Now fixing any p G (0, 1), and then choosing the constant c depending on p and a, 
the lemma follows. By optimizing on the choice of p, one could get better constants if 
one wants. □ 



5 Further discussion 

Some words on the proof of Theorem VII. 1 



5. FURTHER DISCUSSION 
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The main difficulty here is that the quantity of interest, /(w) := dist(^(0,f), is no 
longer invariant under a large class of graph automorphisms. This lack of symmetry 
makes the study of influences more difficult. For example, edges near the endpoints 
or V have very high influence (of order one). To gain some more symmetry, the authors 
in |BKS03j rely on a very nice "averaging" procedure. We refer to this paper for more 
details. 

Known lower bounds on the fluctuations 

We discussed mainly here ways to obtain upper bounds on the fluctuations of the 
shapes in first passage percolation. It is worth pointing out that some non-trivial lower 
bounds on the fluctuations are known for Z^. See |PP94t INP95j . 

Remark VII. 5. We end by mentioning that the proof given in |BKS03j was based on 
an inequality by Talagrand. The proof given here avoids this inequality. 
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Exercise sheet of Chapter |VII 



Problem VII. 1. Let n > 1 and d > 2. Consider the random metric on the torus 
TJ^jnL^ as described in this chapter. For any > 1, let J^^ be the event that the 
shortest "horizontal" circuit is < fc. If d > 3, show that for any choice of fc„ = 
the family of events Jt^ is noise sensitive. (Note that the situation here is similar 
to the Problem 1.9 in Chapter [ij) Finally, discuss the two-dimensional case, d = 2 
(non-rigorously) . 



Exercise VII. 2. Show that Lemma VII. 3 is false if Ifc(/) is taken to be the square of the 
norm of Vfc/ rather than the probability of its support (i.e. find a counterexample). 
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Chapter VIII 

Randomized algorithms and noise 
sensitivity 



In this chapter, we explain how the notion of revealment for so-called randomized 
algorithms can in some cases yield direct information concerning the energy spectrum 
which may allow not only noise sensitivity results but even quantitative noise sensitivity 
results. 



1 BKS and randomized algorithms 



In the previous chapter, we explained how Theorem L5 together with bounds on the 
pivotal exponent for percolation yields noise sensitivity for percolation crossings. How- 
ever, in |BKS9 9I|. a different approach was in fact used for showing noise sensitivity 



which, while still using Theorem 1.5, did not use these bounds on the critical expo- 
nent. In that approach, one sees the first appearance of randomized algorithms. In a 
nutshell, the authors showed that (1) if a monotone function is very uncorrelated with 
all majority functions, then it is noise sensitive (in a precise quantitative sense) and 
(2) percolation crossings are very uncorrelated with all majority functions. The latter 



is shown by constructing a certain algorithm which, due to the RSW Theorem II. 1 
looks at very few bits but still looks at enough bits in order to be able to determine the 
output of the function. 



2 The revealment theorem 

An algorithm for a Boolean function / is an algorithm A which queries (asks the values 
of) the bits one by one, where the decision of which bit to ask can be based on the 
values of the bits previously queried, and stops once / is determined (being determined 
means that / takes the same value no matter how the remaining bits are set). 

A randomized algorithm for a Boolean function / is the same as above but 



83 



84CH AFTER VIII. 



RANDOMIZED ALGORITHMS AND NOISE SENSITIVITY 



auxiliary randomness may also be used to decide the next value queried (including for 
the first bit). [In computer science, the term randomized decision tree would be used 
for our notion of randomized algorithm, but we will not use this terminology.] 

The following definition of revealment will be crucial. Given a randomized algorithm 
A for a Boolean function /, we let Ja denote the random set of bits queried by A. 
(Note that this set depends both on the randomness corresponding to the choice of u 
and the randomness inherent in running the algorithm, which are of course taken to be 
independent.) 

Definition VIII. 1. The revealment of a randomized algorithm A for a Boolean 
function f , denoted by 6a, is defined by 

6a '■= maxP(2 G Ja)- 

ie[n] 

The revealment of a Boolean function /, denoted by 6f, is defined by 

6f := inf 6a 

where the infimum is taken over all randomized algorithms A for f . 

This section presents a connection between noise sensitivity and randomized al- 
gorithms. It will be used later to yield an alternative proof of noise sensitivity for 



percolation crossings which is not based upon Theorem 1.5 (or Proposition V.5). Two 



other advantages of the algorithmic approach of the present section over that mentioned 



in the previous section (besides the fact that it does not rest on Theorem 1.5) is that 
it applies to nonmonotone functions and yields a more "quantitative" version of noise 
sensitivity. 

We have only defined algorithms, randomized algorithms and revealment for Boolean 
functions but the definitions immediately extend to functions / : i7„ — )■ M. 
The main theorem of this section is the following. 

Theorem VIII. 1 ( |SS10bj ). For any function / : fi„ — )■ M and for each k = 1,2, ... , 
we have that 

Efik)= Yl hsr<6fk\\fr, (vm.i) 

SC[n], \S\=k 

where \\f\\ denotes the L"^ norm of f with respect to the uniform probability measure on 
fl and 6f is the revealment of f . 



Before giving the proof, we make some comments to help the reader see what is hap- 
pening and suggest why a result like this might be true. Our original function is a 
sum of monomials with coefficients given by the Fourier coefficients. Each time a bit 
is revealed by the algorithm, we obtain a new Boolean function obtained by just sub- 
stituting in the value of the bit we obtained into the corresponding variable. On the 
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algebraic side, those monomials which contain this bit go down by 1 in degree while the 
other monomials are unchanged. There might however be cancellation in the process 
which is what we hope for since when the algorithm stops, all the monomials (except the 
constant) must have been killed. The way cancellation occurs is illustrated as follows. 
The Boolean function at some stage might contain {l/3)x2X4^X5 + (l/3)a;2a;4 and then 
the bit X5 might be revealed and take the value —1. When we substitute this value into 
the variable, the two terms cancel and disappear, thereby bringing us 1 step closer to 
a constant (and hence determined) function. 

As far as why the result might be true, the intuition, very roughly speaking, is as 
follows. The theorem says that for a Boolean function we cannot, for example, have 
5 = 1/1000 and /({^})^ = 1/2- If the level 1 monomials of the function were 

aiCOi + a2UJ2 + ■ ■ ■ + anCOn, 

then it is clear that after the algorithm is over, then with high probability, the sum of the 
squares of the coefficients of the terms which have not been reduced to a constant is still 
reasonably large. Therefore, since the function at the end of the algorithm is constant, 
these remaining terms must necessarily have been cancelled by higher degree monomials 
which, after running the algorithm, have been "reduced to" degree 1 monomials. If, for 
the sake of this heuristic argument, we assume that each bit is revealed independently, 
then the probability that a degree k > 2 monomial is brought down to a degree 1 
monomial (which is necessary for it to help to cancel the degree 1 terms described 
above) is at most S'''^ and hence the expected sum of the squares of the coefficients 
from the degree k >2 monomials which are brought down to degree 1 is at most 
The total such sum for levels 2 to n is then at most 

n 
k=2 

which won't be enough to cancel the (originally) degree 1 monomials which remained 
degree 1 after running the algorithm if S is much less than ^ similar 

heuristic works for the other levels. 

Proof. In the following, we let Q denote the probability space that includes the ran- 
domness in the input bits of / and the randomness used to run the algorithm (which we 
assume to be independent) and we let E denote the corresponding expectation. With- 
out loss of generality, elements of fl can be represented as a) = (w, r) where u are the 
random bits and r represents the randomness necessary to run the algorithm. 
Now, fix A; > 1. Let 

\S\=k 



The left hand side of (VIII.l) is equal to WgW^. 
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Let J C [n] be the random set of all bits examined by the algorithm. Let A denote 
the minimal a-field for which J is measurable and every Ui, i E J, is measurable; this 
can be viewed as the relevant information gathered by the algorithm. For any function 
/i : — 7- M, let /ij : r2 — 7- M denote the random function obtained by substituting the 
values of the bits in J. More precisely, if a) = (w, r) and u' G Q, then hj{Cj){uj') is h{(jj") 
where u" is u on J {Co) and is u' on [n\\J{Cj). In this way, hj is a random variable on Vt 
taking values in the set of mappings from f2 to M and it is immediate that this random 
variable is ^-measurable. When the algorithm terminates, the unexamined bits in Vt 
are unbiased and hence E[/i|^] = J hj{= hj(^)) where J is defined, as usual, to be 
integration with respect to uniform measure on fi. It follows that E[/i] = E[J hj\. 

Similarly, for all /i. 



(VIII.2) 



Since the algorithm determines /, it is ^ measurable, and we have 



E 



||(7f =E[(7/]=E[e[(7/|^] 
Since E^^f | ^] = gj{%), Cauchy-Schwarz therefore gives 

\\gr<^nhW]\\f\\- (VIII.3) 

We now apply Parseval's formula to the (random) function gj: this gives (for any 

mf = \\9j\\i - E ^^(^)'- 

\s\>o 

Taking the expectation over u E Cl, this leads to 

E[gjm'] = E[\\gj\\l]-J2^9ASr] 

\s\>o 



\g\\l-Y,^[9jiSf] byllVng 

|5|>0 



V g(S?- Y ElgAS?] . , , , a, ■ . 
^\ J I \ \ on level-K coefncients 



\S\=k 



|5|>0 



since g is supported 



< 



Y^E{g{Sf - gASf\ 

\S\=k 



by restricting to 
level-fc coefficients 



Now, since gj is built randomly from g by fixing the variables in J = J(a)), and 
since g by definition does not have frequencies larger than fc, it is clear that for any S 
with 15*1 = A; we have 



g{S) = f{S), iiSnJ{oo) 
0, otherwise. 
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Therefore, we obtain 



\E[g\j]\\l = E[gM'] < 9iS)'¥[S H J ^ ^ < \\g\\lk5. 

\S\=k 



Combining with (VIII. 3) completes the proof. 



□ 



Proposition IV. 1 and Theorem VIII. 1 immediately imply the following corollary. 



Corollary VIII. 2. // the revealments satisfy 



lim 6f„=0, 



then {fn} is noise sensitive. 



In the exercises, one is asked to show that certain sequences of Boolean functions are 
noise sensitive by applying the above corollary. 



3 An application to noise sensitivity of percolation 



In this section, we apply Corollary VIII. 2 to prove noise sensitivity of percolation cross- 
ings. The following result gives the necessary assumption that the revealments approach 
0. 

Theorem VIII. 3 ( |SS10b] ). Let f = fn be the indicator function for the event that 
critical site percolation on the triangular grid contains a left to right crossing of our 
n X n box. Then Sf^ < n~^^'^~^°^^^ as n oo. 

For critical bond percolation on the square grid, this holds with 1/4 replaced by some 
positive constant a > 0. 

Outline of Proof. We outline the argument only for the triangular lattice; the argu- 
ment for the square lattice is similar. We first give a first attempt at a good algorithm. 
We consider from Chapter [IT] the exploration path or interface from the bottom right 
of the square to the top left used to detect a left right crossing. This (deterministic) 
algorithm simply asks the bits that it needs to know in order to continue the interface. 
Observe that if a bit is queried, it is necessarily the case that there is both a black and 
white path from next to the hexagon to the boundary. It follows, from the exponent of 
1/4 for the 2- arm event in Chapter |ll| that, for hexagons far from the boundary, the 
probability that they are revealed is at most as desired. However, one cannot 

conclude that points near the boundary have small revealment and of course the right 
bottom point is always revealed. 

The way that we modify the above algorithm so that all points have small revealment 
is as follows. We first choose a point x at random from the middle third of the right 
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side. We then run two algorithms, the first one which checks whether there is a left 
right path from the right side above x to the left side and the second one which checks 
whether there is a left right path from the right side below x to the left side. The first 
part is done by looking at an interface from x to the top left corner as above. The 
second part is done by looking at an interface from x to the bottom left corner as above 
(but where the colors on the two sides of the interface need to be swapped.) 

It can then be shown with a little work (but no new conceptual ideas) that this 
modified algorithm has the desired revealment of at most as desired. One 

of the things that one needs to use in this analysis is the so-called one-arm half-plane 
exponent, which has a known value of 1/3. See |SS10b] for details. □ 





3.1 First quantitative noise sensitivity result 

In this subsection, we give our first "polynomial bound" on the noise sensitivity of per- 
colation. This is an important step in our understanding of quantitative noise sensitivity 



of percolation initiated in Chapter VI 



Recall that in the definition of noise sensitivity, e is held fixed. However, as we have 



seen in Chapter VI, it is of interest to ask if the correlations can still go to when 
e = en goes to with n but not so fast. The techniques of the present chapter imply 
the following result. 



Theorem VIII.4 ( |SS10bj ). Let {/„} be as in Theorem 
lattice, for all 'y < 1/8, 



VIII. 3\ Then, for the triangular 

(VIII.4) 



lim E[fn{0o)fn{0Ol/n.)] " Wnioo)]^ = 0. 
n—^oo 

On the square lattice, there exists some 7 > with the above property. 

Proof. We prove only the first statement; the square lattice case is handled similarly. 



First, (IV. 3) gives us that every n and 7, 



(VIII.5) 



k=l 
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Note that there are order terms in the sum. Fix 7 < 1/8. Choose e > so that 
7 + e < 1/8. For large n, we have that 5/„ < l/n^^^~^. The right hand side of (VIII. 5) 
is at most 

J2 fc/ni/4- + (l-l/nT'^^'' 

k=l 



by breaking up the sum at rC^'^f'^ and applying Theorems VIII. 1 and VIII. 3 to bound 
the Ef^{k) terms in the first part. The second term clearly goes to while the first 
part also goes to by the way e was chosen. □ 

Observe that the proof of Theorem VIII. 4| immediately yields the following general 
result. 

Corollary VIII. 5. Let {fn} be a sequence of Boolean functions on rUn bits with 6{fn) < 
0(1)/^^ for all n. Then for all 7 < (5/2, we have that 



lim E[/„(u;)/„(a;i/„.)] - E[/„(a;)]2 = 0. 



(VIII.6) 



4 Lower bounds on revealments 

One of the goals of the present section is to show that one cannot hope to reach the 



conjectured 3/4-sensitivity exponent with Theorem VIII. 1 Theorem VIII. 4 told us 
that we obtain asymptotic decorrelation if the noise is 1/n"' for 7 < 1/8. Note that 
this differs from the conjectured "critical exponent" of 3/4 by a factor of 6. In this 
section, we investigate the degree to which the 1/8 could potentially be improved and 
in the discussion, we will bring up an interesting open problem. We will also derive 
an interesting general theorem giving a nontrivial lower bound on the revealment for 
monotone functions. We start with the following definition. 

Definition VIII. 2. Given a randomized algorithm A for a Boolean function f , let 
C{A) (the cost of A) be the expected number of queries that the algorithm A makes. Let 
C{f) (the cost of f) be the infimum of C{A) over all randomized algorithms A for f . 

Remark VIII. 1. (i). It is easy to see that C{f) is unchanged if we take the infimum 
over deterministic algorithms. 

(ii) . Clearly tiSa > C{A) and hence n5f > C{f). 

(iii) . C{f) is at least the total influence I(/) since for any algorithm A and any i, the 
event that i is pivotal necessarily implies that the bit i is queried by A. 

The following result due to O'Donnell and Servedio ( jOS07] )is an essential improve- 
ment on the third part of the last remark. 

Theorem VIII.6. Let f be a monotone Boolean function mapping Qn into { — 1, 1}. 
Then C{f) > l{ff and hence 5f > I(/)Vn. 
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Proof. Fix any randomized algorithm A for /. Let J 
queried by A. We then have 



Ja be the random set of bits 



where the first equahty uses monotonicity (recall Proposition IV. 4) and then the Cauchy- 



Schwarz inequality is used. We now bound the first term by 1. For the second moment 
inside the second square root, the sum of the diagonal terms yields IE[|t/|] while the 
cross terms are all since for i ^ j, K[u!iI^i^j}UjI^j^jy] = as can be seen by breaking 
up the sum depending on whether i or j is queried first. This yields the result. □ 

Returning to our event / „ of percolation crossings, since the sum of the influences 

,3/4+o(l) 



IS n 



Theorem 



VIII. 6| tells us that 6f^ > n i/2+o(i)_ follows from the method 



of proof in Theorem VIII. 4 that Theorem VIII. 1 cannot improve the result of Theorem 
VIII. 4 past 7 = 1/4 which is still a factor of 3 from the critical value 3/4. Of course. 



one could investigate the degree to which Theorem VIII. 1 itself could be improved. 

Theorem VIII. 3 tells us that there are al gorithms An for /„ such that C{An) < 
^7/4+o(i) ^Yie other hand, Theorem VIII. 6 tell us that it is necessarily the case that 
C{A) > n6/4+°(i). 

Open Question: Find the smallest a such that there are algorithms An for with 
CiAn) < rf. (We know a G [6/4, 7/4].) 

We mention another inequality relating revealment with influences which is a con- 
sequence of the results in |OSSS05] . 

Theorem VIII. 7. Let j he a Boolean junction mapping f2„ into { — 1, 1}. Then 6f > 
Var(/)/(nmaXi hif)) 



It is interesting to compare Theorems VIII. 6 and VIII. 7 Assuming Var(/) is of 
order 1, and all the influences are of order 1/n", then it is easy to check that Theorem 



VIII. 6 gives a better bound when a < 2/3 and Theorem VIII. 7 gives a better bound 
when a > 2/3. For crossings of percolation, where a should be 5/8, it is better to use 
Theorem IVIII.6I rather than IVIII.7I 

Finally, there are a number of interesting results concerning revealment obtained in 
the paper |BSW05j . Four results are as follows. 

1. If / is reasonably balanced on n bits, then the revealment is at least of order l/ra^/^. 

2. There is a reasonably balanced function on n bits whose revealment is at most 
0(l)(logn)/ni/2. 

3. If / is reasonably balanced on n bits and is monotone, then the revealment is at 
least of order 1/n^/^. 

4. There is a reasonably balanced monotone function on n bits whose revealment is at 
most 0(l)(log n)/n^/^. 

We finally end this section by giving one more reference which gives an interesting 
connection between percolation, algorithms and game theory; see |PSSW07] . 
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5 An application to a critical exponent 



In this section, we show how Theorem VIII. 1 or in fact Theorem |VIII.6 can be used to 
show that the 4- arm exponent is strictly larger than 1; recall that with SLE technology, 
this can be shown for the triangular lattice. 

Proposition VIII. 8. Both on the triangular lattice T and on 71? , there exists eo > 
such that 

MR) < l/R^^'° 



We will assume the separation of arms result mentioned earlier in Chapter VI which 
says that for the event Jr, the influence of any variable further than distance -R/10 
from the boundary, a set of variables that we will denote by B for bulk, is x a^lR). 



Proof. Theorems | VI 1 1. 3 and VIII. 1| imply that for some a > 0, 

i 

Next, using the separation of arms as explained above, we have 

R'al{R)< 0(1)^2^1 



(VIII.7) 



Proposition IV.4 then yields 



R'ai{R) < Oil/R") 



and the result follows. 



□ 



Observe that Theorem |VIII.6| could also be used as follows. Theorem |VIII.3| implies 
that Cifn) < for some a > and then Theorem fmL6\ yields I(/r)2 < R"^'". 



Exactly as in (VIII.7), one has, again using separation of arms, that 

R'a,iR)<0il)J2h<0imfn). 



(VIII.8) 



Altogether this gives us 



again yielding the result. 



R^ai{R) < 0{1)R 



2-a 



We finally mention that it is not so strange that either of Theorems VIII. 1 



or 



VIII. 6| can be used here since, as the reader can easily verify, for the case of monotone 
functions all of whose variables have the same influence, the case /c = 1 in Theorem 



VIII. 1 is equivalent to Theorem VIII. 6 



Remark VIII. 2. We now mention that the proof for the multi-scale version of Proposi- 



tion VI. 6 is an extension of the approach of O'Donnell and Servedio above. 
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6 Does noise sensitivity imply low revealment? 

As far as these lectures are concerned, this subsection will not connect to anything that 
follows and hence can be viewed as tangential. 

It is natural to ask if the converse of Corollary |VIII.2 might be true. A moment's 



thought reveals that example [2| Parity, provides a counterexample. However, it is more 
interesting perhaps that there is a monotone counterexample to the converse which is 
provided by example |5| Clique containment. 

Proposition VIII. 9. Clique containment provides an example showing that the con- 



verse of Corollary VIII. 2 is false for monotone functions. 



Outline of Proof. We first explain more precisely the size of the clique that we are 
looking for. Given n and fc, let f{n, k) := (^)2~^2)^ which is just the expected number 
of cliques of size in a random graph. When k is around 21og2(n), it is easy to check 
that /(n, k + l)//(n, k) is o(l) as n — )• oo. For such k, clearly if /(n, k) is small, then 
with high probability there is no fc-clique while it can be shown, via a second moment 
type argument, that if /(n, k) is large, then with high probability there is a fc-clique. 
One now takes /c„ to be around 21og2(n) such that /(n, A;„) > 1 and /(n, A;„ + 1) < 1. 
Since /(n, k + 1)/ f{n, k) is o(l), it follows with some thought from the above that the 
clique number is concentrated on at most 2 points. Furthermore, if /(n, is very large 
and /(n, kn + 1) very small, then it is concentrated on one point. Again, see |ASOO] for 
details. 

Finally, we denote the event that the random graph on n vertices contains a clique 
of size kn by A^. We have already seen in one of the exercises that this example is 
noise sensitive. We will only consider a sequence of n's so that A^ is nondegenerate in 
the sense that the probabilities of this sequence stay bounded away from and 1. An 
interesting point is that there is such a sequence. Again, see |AS00j for this. To show 
that the revealments do not go to 0, it suffices to show that the sequence of costs (see 



Definition VIII. 2 and the remarks afterwards) is fi(n^). We prove something stronger 



but, to do this, we must first give a few more definitions. 

Definition VIII. 3. For a given Boolean function f , a witness for u is any subset W 
of the variables such that the elements of u in W determine f in the sense that for 
every u' which agrees with u on W , we have that f{uj) = f{u)'). The witness size 
of u, denoted w{u), is the size of the smallest witness for u. The expected witness 
size, denoted by w{f), is K{w{u)). 

Observe that, for any Boolean function /, the bits revealed by any algorithm A for 
/ and for any u is always a witness for u. It easily follows that the cost C{f) satisfies 
> "^(Z)- Therefore, in order to prove the proposition, it suffices to show that 



w{fn)=n{n^). 



(VIII.9) 
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Remark VIII. 3. (i). The above also implies that with a fixed uniform probability, w{u) 
is n(n2). 

(ii) . Of course when is 1, there is always a (small) witness of size (^^2") ^ ^^"^ 
the large average witness size comes from when /„ is —1. 

(iii) . However, it is not deterministically true that when is —1, w{u) is necessarily of 
size r2(r2^). For example, for u = —1 (corresponding to the empty graph), the witness 
size is o(n^) as is easily checked. Clearly the empty graph has the smallest witness size 
among u with /„ = —1. 

Lemma VIII. 10. Let En be the event that all sets of vertices of size at least .97n 
contains Ck^^z- Then lim„_>oo IP(-E'n) = 1- 

Proof. This follows, after some work, from the Janson inequalities. See |AS00j for 
details concerning these inequalities. □ 

Lemma VIII. 11. Let U be any collection of at most n^/1000 edges in Cn- Then there 
exist distinct vi,V2,vs such that no edge in U goes between any Vi and vj and 

\{e E U : e is an edge between {vi,V2,V3} and {vi,V2,V3Y}\ < n/50. (VIII. 10) 

Proof. We use the probabilistic method where we choose {f 1, f2, ^3} to be a uniformly 
chosen 3-set. It is immediate that the probability that the first condition fails is at most 



^\U\/{f) < 1/100. Letting Y be the number of ed ges in the set appearing in (^VIII.10| 
and Y' be the number of U edges touching vi, it is easy to see that 

E{Y) < 3E{Y') = 6\U\/n < n/100 

where the equality follows from the fact that, for any graph, the number of edges is 



half the total degree. By Markov's inequality, the probability of the event in (VIII. 10) 
holds with probably at least 1/2. This shows that the random 3-set {vi, V2, V3} satisfies 
the two stated conditions with positive probability and hence such a 3-set exists. □ 



By Lemma VIII. 10, we have P(A^ H En) > c > for all large n. To prove the 
theorem, it therefore suffices to show that if fl En occurs, there is no witness of size 
smaller than n^/1000. Assume U to be any set of edges of size smaller than n^/1000. 



Choose {f 1, f2, fs} from Lemma VIII. 11 By the second condition in this lemma, there 
exists a set S of size at least .97n which is disjoint from {fi, ^2, "^^3} which has no U-edge 
to {vi,V2,V3}. Since En occurs, S contains a Ck„-3, whose vertices we denote by T. 
Since there are no [/-edges between T and {f 1, ^"25 "^^3} or within {t>i, V2,V3} (by the first 



condition in Lemma VIII. 11 ) and T is the complete graph, U cannot be a witness since 



A'^ occured. □ 



The key step in the proof of Proposition VIII. 9 is (VIII. 9). This is stated without 
proof in |FKW02] : however, E. Friedgut provided us with the above proof. 
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Exercise sheet of Chapter |VIII 



Exercise VIII. 1. Compute the revealment for Majority function on 3 bits. 



Exercise VIII. 2. Use Corollary VIII.2 to sliow tliat Examples |4] and [6} Iterated 3- 
Majority function and tribes, are noise sensitive. 

Exercise VIII. 3. For transitive monotone functions, is there a relationship between 
revealment and the minimal cost over all algorithms? 



Exercise VIII. 4. Show that for transitive monotone functions. Theorem VIII. 6 yields 
the same result as Theorem IVIII.ll does for the case k = 1. 

Exercise VIII. 5. What can you say about the sequence of revealments for the Iterated 
3-Majority function? [It can be shown that the sequence of revealments decays like 
for some a but it is an open question what a is.] 

Exercise VIII. 6. You are given a sequence of Boolean functions and told that it is 
not noise sensitive using noise e„ = What, if anything, can you conclude about 

the sequence of revealments 5^? 



Exercise VIII. 7. Note that a consequence of Corollary VIII.2 and the last line in 



Remark IV. 2 is that if {/n} is a sequence of monotone functions, then, if the reveal- 
ments of {/„} go to 0, the sums of the squared influences approach 0. Show that this 
implication is false without the monotonicity assumption. 
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Chapter IX 

The spectral sample 



It turns out that it is very useful to view the Fourier coefficients of a Boolean function 
as a random subset of the input bits where the "weight" or "probability" of a subset 
is its squared Fourier coefficient. It is our understanding that it was Gil Kalai who 
suggested that thinking of the spectrum as a random set could shed some light on the 
types of questions we are looking at here. The following is the crucial definition in this 
chapter. 

1 Definition of the spectral sample 

Definition IX. 1. Given a Boolean function f : Qn ^ {il} {0, 1}, we let the 
spectral measure Q = Q/ of f be the measure on subsets {1, . . . ,n} given by 

Qf(S) 5 C {!,..., n}. 

We let S^f = y denote a subset of {1, . . . ,n} chosen according to this measure and 
call this the spectral sample. We let Q also denote the corresponding expectation 
(even when Q is not a probability measure). 

By Parseval's formula, the total mass of the so-defined spectral measure is 

E fiSf = E[f']. 

5c{l,...,n} 

This makes the following definition natural. 

Definition IX. 2. Given a Boolean function f : Qn ^ {il} or {0, 1}, we let the 
spectral probability measure F — Ff of f be the probability measure on subsets of 
{1, . . . , n} given by 
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Since P/ is just Qj up to a renormalization factor, the spectral sample ,5^f = 5^ will 
denote as well a random subset of [n] sampled according to Ff. We letKf = E denote 
its corresponding expectation. 

Remark IX. 1. 

(i) Note that if / maps into {±1}, then, by Parseval's formula, Qf = Ff while if it 
maps into {0, 1}, Q/ will be a subprobability measure. 

(ii) Observe that if (/„)„ is a sequence of non-degenerate Boolean functions into {0,1}, 
then P/„x%„. 

(iii) There is no statistical relationship between u and as they are defined on 
different probability spaces. The spectral sample will just be a convenient point 
of view in order to understand the questions we are studying. 

Some of the formulas and results we have previously derived in these notes have 
very simple formulations in terms of the spectral sample. For example, it is immediate 
to check that (IV. 2) simply becomes 

E[/M/K)] = %[(!- 6)1^1] (1X.1) 

or 

E[fiu)fico,)] - E[/H]2 = %[(! - e)l^lj^^0]. (IX.2) 

Next, in terms of the spectral sample. Propositions |IV.1| and |IV.2 simply become 
the following proposition. 

Proposition IX. 1. // {/„} is a sequence of Boolean functions mapping into {±1}, 
then we have the following. 

1- {/n} is noise sensitive if and only ifl^f^l — )■ oo in probability on the set {\^f„\ 7^ 0}. 
^- {fn} is noise stable if and only if the random variables are tight. 

There is also a nice relationship between the pivotal set V and the spectral sample. 
The following result, which is simply Proposition IV. 3 (see also the remark after this 
proposition), tells us that the two random sets V and ^ have the same 1-dimensional 
marginals. 

Proposition IX.2. If f is a Boolean function mapping into {±1}, then for all i G [n] 
we have that 

p(i ev) = Qii E y) 

and hence E(|P|) = Q(|^|). 

(This proposition is stated with Q instead of P since if / maps into {0, 1} instead, 
then the reader can check that the above holds with an extra factor of 4 on the right 
hand side while if P were used instead, then this would not be true for any constant.) 
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Even though ^ and V have the same "1-dimensional" marginals, it is not however 
true that these two random sets have the same distribution. For example, it is easily 
checked that for MAJ3, these two distributions are different. Interestingly, as we will 
see in the next section, ^ and V also always have the same "2-dimensional" marginals. 
This will prove useful when applying second moment method arguments. 



Before ending this section, let us give an alternative proof of Proposition |VI.9| using 
this point of view of thinking of random set. 



Alternative proof of Proposition VI. 9 The statement of the proposition when 



converted to the spectrum states (see the exercises in this chapter if this is not clear) 
that for any a„ — )■ 00, 

lim P(|^„| > anu'^a^in)) = 0. 



However this immediately follows from Markov's inequality using Propositions VI.8 and 



2 A way to sample the spectral sample in a sub- 
domain 

In this section, we describe a method of "sampling" the spectral measure restricted to 
a subset of the bits. As an application of this, we show that and V in fact have the 
same 2-dimensional marginals, namely that for all i and j, eV) = Q{i,j G =5^). 

In order to first get a little intuition about the spectral measure, we start with an 
easy proposition. 

Proposition IX. 3 ( |GPS10 ]). For a Boolean function f and A C {1,2, ...,n}, we 
have 

Q(^;CA)=E[|E(/|A)p] 
where conditioning on A means conditioning on the bits in A. 

Proof. Noting that E(x5'|y4) is xs S A and otherwise, we obtain by expanding 
that 

Eif\A) = J2hS)xs. 

SCA 

Now apply Parseval's formula. □ 

If we have a subset A C {1,2,. ..,n}, how do we "sample" from A fl A nice 
way to proceed is as follows: choose a random configuration outside of A, then look at 
the induced function on A and sample from the induced function's spectral measure. 
The following proposition justifies in precise terms this way of sampling. Its proof is 



just an extension of the proof of Proposition IX. 3 
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Proposition IX. 4 ( |GPS10j ). Fix a Boolean function f on For A C {1, 2, . . . , n} 

and y G {±1}'^'', that is a configuration on A'^ , let gy be the function defined on {±1}"^ 
obtained by using f but fixing the configuration to be y outside of A. Then for any 
S A, we have 

nA = S)=E[Q{y,^ = S)]=E[gl{S)]. 



Proof. Using the first line of the proof of Proposition IX.3 , it is easy to check that for 
any S C A, we have that 



S'CA<= 



This gives 



E 



nA = s] 



S'CA<= 



which is precisely the claim. 



□ 



Remark IX.2. Observe that Proposition |IX.3| is a special case of Proposition |IX.4| when 
5* is taken to be and A is replaced by A'^. 

The following corollary was first observed by Gil Kalai. 

Corollary IX. 5 ( |GPS10j ). If f is a Boolean function mapping into {±1}, then for 
all i and j , 



(The comment immediately following Proposition IX.2 holds here as well.) 

Proof. Although it has already been established that V and ^ have the same 1- 
dimensional marginals, we first show how Proposition |IX.4 can be used to establish 
this. This latter proposition yields, with A = S = {i}, that 



Q(z ey) = Q{y n {z} = {z}) = E[gli{t})]. 

Note that gy is ±Ui if i is pivotal and constant if i is not pivotal. Hence the last term 
is F{i e V). 

For the 2-dimensional marginals, one first checks this by hand when n = 2. For 
general n, taking A = S = in Proposition IX. 4, we have 

Q(z, J G ^) = n K j} = {^, j}) = E[^J({z,j})]. 

For fixed y, the n = 2 case tells us that gy{{i,j}) = F{i,j G Vgy). Finally, a little 
thought shows that E[P(i, j G Vg^)] = f{i,j G V), completing the proof. □ 
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3 Nontrivial spectrum near the upper bound for 
percolation 

We now return to our central event of percolation crossings of the rectangle Rn where 
fn denotes this event. At this point, we know that for Z^, (most of) the spectrum lies 
between (for some eo > 0) and 0:4(71) while for T it sits between n^/^+"W ^^d 
^3/4+0(1) _ ^Yiis section, we show that there is a nontrivial amount of spectrum near 
the upper bound 71^04 (n). For T, in terms of quantitative noise sensitivity, this tells us 
that if our noise sequence e„ is equal to l/n^^'^~^ for fixed 5 > 0, then in the limit, the 
two variables f{u) and f{uj^^) are not perfectly correlated; i.e., there is some degree 
of independence. (See the exercises for understanding such arguments.) However, we 
cannot conclude that there is full independence since we don't know that "all" of the 
spectrum is near (yet!). 

Theorem IX. 6 ( |GPS10j ). Consider our percolation crossing functions {fn} (with 
values into {iljy' of the rectangles Rn for 7? orT. There exists c > such that for all 
n, 



The key lemma for proving this is the following second moment bound on the number 
of pivotals which we prove afterwards. It has a similar flavor to Exercise 6 in Chapter 

ED 

Lemma IX. 7 ( ^GPSlOj ). Consider our percolation crossing functions {fn} above and 
let R'^ be the box concentric with Rn with half the radius. If Xn = |P„ H R'J is the 
cardinality of the set of pivotal points in R'^, then there exists a constant C such that 
for all n we have that 

E\\Xn\'] < CE\\Xn\?. 



Proof of Theorem IX. 6| , Since Vn and have the same 1 and 2-dimensional 



marginals, it follows fairly straightforward from Lemma IX. 7 that we also have that for 
all n 



p[|^nnKI'] <CF[\ynnR'j]\ 

Recall now the Paley-Zygmund inequality which states that if Z > 0, then for all 
(0,1), 

FiZ>eE[Z])>{l-9f^^. (IX.3) 
The two above inequalities (with Z = \S^n H -R^| and = 1/2) imply that for all n, 

-^n En^„n<|l, 1 
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Now, by Proposition IX. 2, one has that ]E[|t5^„ fl -R^^l] = E[X„]. Furthermore (a 
trivial modification of) Proposition VI. 8 yields ]E[X„] x n'^a^ln) which thus completes 
the proof. □ 

We are now left with 



we 



Proof of Lemma IX. 7[ As indicated at the end of the proof of Theorem IX. 6 
have that ]E(X„) x n'^a/i{n). Next, for x,y E R'^, a picture shows that 

P(x, yeVn)< al{\x - y\/2)ai{2\x - y\,n/2) 



since we need to have the 4-arm event around x to distance |x — y|/2, the same for y, and 
the 4-arm event in the annulus centered at {x + y)/2 from distance 2\x — y\ to distance 
n/2 and finally these three events are independent. This is by quasi-multiplicity at 
most 

0{l)al{n)/a4{\x - y\,n) 



and hence 



E[\X^\']<0{l)al{n)J2 



1 



x,y 



ai{\x - yln)' 



Since, for a given x, there are at most 0(1)2^^ ?/'s with \x — y\ E [2'^, 2''""'""'^], using 
quasi-multiplicity, the above sum is at most 



log2(n) 



0{l)n^al{n) 



■)2k 



k=0 



04(2^, n) 



Using 



< (R/r 



,2-e 



0:4 (r, R) 

(this is the fact that the four-arm exponent is strictly less than 2), the sum becomes at 



most 



log2(") 



0{l)n^-'al{n) ^ 2'=^ 



k=0 



Since the last sum is at most 0{l)n'^, we are done. 



□ 



In terms of the consequences for quantitative noise sensitivity. Theorem IX. 6 implies 
the following corollary; see the exercises for similar implications. We state this only for 
the triangular lattice. An analogous result holds for Z^. 

Corollary IX. 8. For T, there exists c> so that if = 1/(72^04(71)), then for all n, 

nUuj) ^ /„(u;,J) > c. 
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Note, importantly, this does not say that /^(a;) and fn{uje„) become asymptotically 
uncorrelated, only that they are not asymptotically completely correlated. To ensure 
that they are asymptotically uncorrelated is significantly more difficult and requires 
showing that "all" of the spectrum is near n^^^. This much more difficult task is the 
subject of the next chapter. 



CHAPTER IX. THE SPECTRAL SAMPLE 



Exercise sheet on chapter IIX 



Exercise IX. 1. Let {/n} be an arbitrary sequence of Boolean functions mapping into 
{±1} with corresponding spectral samples {^n}- 

(i) . Show that P[0 < < A„] ^ implies that E[(l - e„)l-^"l/^„^0] ^ if e„v4„ ^ 

CO. 

(ii) . Show that E[(1 - e„)l-^"lj^ ^0] ^ implies that P[0 < < A„] if e„/l„ = 
0(1). 

Exercise IX. 2. Let {fn} be an arbitrary sequence of Boolean functions mapping into 
{±1} with corresponding spectral samples {^n}- 

(i) . Show that F[f{u) ^ /(wej] ^ and A„e„ = imply that P[|^„| > A„] ^ 0. 

(ii) . Show that P[|^„| > A„] ^ and A„e„ = o(l) imply that P[/(w) 7^ /(w^J] ^ 0. 



Exercise IX. 3. Prove Corollary IX. 8 



Exercise IX. 4. For the iterated 3-Majority sequence, recall that the total influence is 
n" where a = 1 — log 2/ log 3. Show that for e„ = l/n", P(/„(ci;) 7^ fn{<^t,^)) does not 
tend to 0. 

Exercise IX. 5. Assume that {/«} is a sequence of monotone Boolean functions on n 
bits with total influence equal to n^/^ up to constants. Show that the sequence cannot 
be noise sensitive. Is it necessarily noise stable as the Majority function is? 

Exercise IX. 6. Assume that {/«} is a sequence of monotone Boolean functions with 
mean on n bits. Show that one cannot have noise sensitivity when using noise level 
en = l/ni/2. 

Exercise IX. 7. Show that V and y have the same 2-dimensional marginals using only 
Proposition IX. 3 rather than Proposition IX. 4[ 

Hint: It suffices to show that P({i, j} n P = 0) = Q({i, j} n ^ = 0). 



Exercise IX. 8. (Challenging problem) Do you expect that exercise IX. 5 is sharp, 
meaning that, if 1/2 is replaced by a < 1/2, then one can find noise sensitive examples? 
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Chapter X 

Sharp noise sensitivity of 
percolation 



We will explain in this chapter the main ideas of the proof in |GPS10] that most of 
the "spectral mass" lies near n^a^^n) ^ 71,3/4+0(1) _ This proof being rather long and 
involved, the content of this chapter will be far from a formal proof. Rather it should 
be considered as a (hopefully convincing) heuristic explanation of the main results, and 
possibly for the interested readers as a "reading guide" for the paper |GPS10] . 

Very briefly speaking, the idea behind the proof is to identify properties of the geom- 
etry of t5^/„ which are reminiscent of a self-similar fractal structure. Ideally, S^f^ would 
behave like a spatial branching tree (or in other words a fractal percolation process), 
where distinct branches evolve independently of each other. This is conjecturally the 
case, but it turns out that it is very hard to control the dependency structure within 
yf„. In |GPS10] . only a tiny hint of spatial independence within S^f^ is proved. One 
of the main difficulties of the proof is to overcome the fact that one has very little 
independence to play with. 

A substantial part of this chapter focuses on the much simpler case of fractal per- 
colation. Indeed, this process can be seen as the simplest toy model for the spectral 
sample ^f„- Explaining the simplified proof adapted to this setting already enables us 
to convey some of the main ideas for handling ^f„- 



1 State of the art and main statement 



See Figure X.l where we summarize what we have learned so far about the spectral 
sample S^f^ of a left to right crossing event /„. 

From this table, we see that the main question now is to prove that all the spectral 
mass indeed diverges at speed n'^a4,{n) which is n^/^+°W for the triangular lattice. This 
is the content of the following theorem. 
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on the square lattice on the triangular lattice T 



The spectral 
mass diverges at 
polynomial 
speed 



There is a positive ex- 
ponent e > 0, s.t. 

P[0 < l^/J <n']^0 



The same holds for all e < 
1/8 



Lower tail esti- 
mates 



On both lattices, Theorem VIII. 1 enables to obtain 



(non-sharp) lower tail estimates 



A positive 
fraction of the 
spectral mass 
lies "where it 
should" 



There is some universal c > 
s.t. 

^[\^fJ > cn'^a^in)] > c 



> cn 



3/4- 



-o(l)] > 



May be summa- 
rized by the fol- 
lowing picture 



A smaller 
"bump" of pos- 
itive Spectral 
mass "^^.^ 






of 



At leas' 
a post: 
fraction 
the spec 
tral mass 
lies here 



.3/4+0(1) 



Figure X.l: A summary of some of the results obtained so far for 
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Theorem X.l (jGPSTO]). 

limsupP[0 < l^/J < Xn^a^in)] — > . 

n— >oo A— s>0 

On the triangular lattice T, the rate of decay in A is known explicitly. Namely: 
Theorem X.2 ( jGPSlO j). On the triangular grid T, the lower tail of \-5^f,\ satisfies 

hmsupPfO < lYfJ < AE[|^/J])] x X'/' . 

This result deals with what one might call the "macroscopic" lower tail, i.e. with 
quantities which asymptotically are still of order ]E[|t^/„|] (since A remains fixed in 
the above limsup). It turns out that in our later study of dynamical percolation in 
Chapter XI, we will need a sharp control on the full lower tail. This is the content of 
the following stronger theorem: 

Theorem X.3 ( |GPS10] ). On 1? and on the triangular grid T, for all 1 < r < n, one 

has 

On the triangular grid, this translates into 

P[0 < \yfj\ <u\ n~^u^ , 
where we write ~ to avoid relying on o(l) terms in the exponents. 



2 Overall strategy 

In the above theorems, it is clear that we are mostly interested in the cardinality of 
^f^. However, our strategy will consist in understanding as much as we can about the 
typical geometry of the random set sampled according to the spectral probability 
measure P/„ . 

As we have seen so far, the random set S^f^ shares many properties with the set of 
pivotal points Vj^. A first possibility would be that they are asymptotically similar. 
After all, noise sensitivity is intimately related with pivotal points, so it is not unrea- 
sonable to hope for such a behavior. This scenario would be very convenient for us 
since the geometry of Vf„ is now well understood (at least on T) thanks to the SLE 
processes. In particular, in the case of Vf^, one can "explore" P/,^ in a Markovian way 
by relying on exploration processes. Unfortunately, based on very convincing heuristics, 
it is conjectured that the scaling limits of and ^Vj^ are singular random compact 



sets of the square. See Figure pO] for a quick overview of the similarities and differences 
between these two random sets. 
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Pivotal set 






Spectral set ^f^^ 


First moment 






Second moment 






E[I^/,.P] 


Higher moments 
{k > 3) 


In general, they differ ! 


Methods for sam- 
phng these ran- 
dom sets 


Easy (and fast) using two explo- 
ration paths: 


The spectral sample S^j^^ is 
much harder to sample. 
In fact, the only known way 
to proceed is to compute the 
weights fn{SY, one at a time 














Spatial correla- 
tion stiuctuie 


Distant regions in Vf^ behave 
more or less independently of 
each other. 

Furthermore, one can use the 
very convenient spatial Markov 
property due to the i.i.d struc- 
ture of the percolation picture. 


? 

Much of the picture here re- 
mains unclear 


Lower Tail be- 
havior 




1]> 


: n-T5 P[l^/J = 1] - ^"^ 



Figure X.2: Similarities and differences between S^f^ and Vf„. 



The conclusion of this table is that they indeed share many properties, but one 
cannot deduce lower tail estimates on out of lower tail estimates on \Vf„\. Also, 

even worse, we will not be allowed to rely on spatial Markov properties for 

However, even though Vf^ and S^f^ differ in many ways, they share at least one essen- 
tial property: a seemingly self-similar fractal behavior. The main strategy in |GPS10j 
to control the lower-tail behavior of |=^/„| is to prove that in some very weak sense, 
behaves like the simplest model among self-similar fractal processes in [0,n]^: i.e. 



3. TOY MODEL: THE CASE OF FRACTAL PERCOLATION 



111 



a super-critical spatial Galton- Watson tree embedded in [0,n]^, also called a fractal 
percolation process. The lower tail of this very simple toy model will be investigated in 
detail in the next section with a technique which will be suitable for The main 

difficulty which arises in this program is the lack of knowledge of the independency 
structure within ^f„- In other words, when we try to compare S^f^ with a fractal per- 
colation process, the self-similarity already requires some work, but the hardest part is 
to deal with the fact that distinct "branches" (or rather their analogues) are not known 
to behave even slightly independently of each other. We will discuss these issues in 
Section |4] but will not give a complete proof. 



3 Toy model: the case of fractal percolation 

As we explained above, our main strategy is to exploit the fact that has a certain 
self-similar fractal structure. Along this section, we will consider the simplest case 
of such a self-similar fractal object: namely fractal percolation, and we will detail in 
this simple setting what our later strategy will be. Deliberately, this strategy will not 
be optimal in this simplified case. In particular, we will not rely on the martingale 
techniques that one can use with fractal percolation or Galton- Watson trees, since such 
methods would not be available for our spectral sample 

3.1 Definition of the model and first properties 

To make the analogy with S^f^ easier let 

n:=2^ ,h>l, 

and let's fix a parameter p G (0, 1). 

Now, fractal percolation on [0,n]^ is defined inductively as follows: divide [0,2'^]^ 
into 4 squares and retain each of them independently with probability p. Let be the 
union of the retained 2'^~ ^-squares. The second- level tree is obtained by reiterating 
the same procedure independently for each 2^*" ^-square in T^. Continuing in the same 
fashion all the way to the squares of unit size, one obtains Tn = T := which is a 
random subset of [0,n]^. See |LyPll| for more on the definition of fractal percolation. 



See also Figure |X.3| for an example of T . 

Remark X.l. We thus introduced two different notations for the same random set 
{%i=2^ = T^)- The reason for this is that on the one hand the notation Tn defined on 
[0,?7,]^ = [0,2'*]^ makes the analogy with 5^f^ (also defined on [0,r;,]^) easier, while on 
the other hand inductive proofs will be more convenient with the notation T^. 

In order to have a supercritical Galton- Watson tree, one has to choose p G (1/4, 1). 
Furthermore, one can easily check the following easy proposition. 



Proposition X.4. Let p e (1/4,1). Then 
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Figure X.3: A realization of a fractal percolation = 



E[\Tn\] = = 



and 



mXl'] < 0{l)E[\Tn\]- 



In particular, by the second moment method (e.g. the Paley-Zygmund inequality), 
with positive probability, Tn is of order n^+'°g2P. 



Let 



a := 2 + \0g2P ■ 



This parameter a corresponds to the "fractal dimension" of 7^. To make the analogy 
with even clearer, one could choose p in such a way that a = 2 + logaP = 3/4, but 
we will not need to. 

The above proposition implies that on the event 7^ 7^ 0, with positive conditional 



probability |7^| is large (of order n°). This is the exact analogue of Theorem IX. 6 for 
the spectral sample 



Let us first analyze what would be the analogue of Theorem X.l in the case of our 
toy model Tn- We have the following. 
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Proposition X.5. 



limsupP[0 < ITnl < An°)] — > 

r).— ^nn A— >-0 



Remark X.2. If one could rely on martingale techniques, then this proposition is a 
corollary of standard results. Indeed, as is well-known 



Mi 



is a positive martingale. Therefore it converges, as — )■ oo, to a non- negative random 
variable W > 0. Furthermore, the conditions of the Kesten-Stigum Theorem are ful- 
filled (see for example Section 5.1 in |LyPll| ) and therefore W is positive on the event 
that there is no extinction. This implies the above proposition. 

As we claimed above, we will intentionally follow a more hands-on approach in this 
section which will be more suitable to the random set S^f^ which we have in mind. 
Furthermore this approach will have the great advantage to provide the following much 



more precise result, which is the analogue of Theorem X.3 for 7^. 
Proposition X.6. For any 1 < r < n, 

P[0 < ITnl <r"] - r-Vog2i/M 



S2 - 

n 



where fi is an explicit constant in (0, 1) computed in Exercise X.2 



3.2 Strategy and heuristics 

Letting m <^ ra", we wish to estimate P[0 < |7^| < m]. Even though we are only in- 
terested in the size of 7^, we will try to estimate this quantity by understanding the 
geometry of the conditional set: 



' n 



\u 



< 17^1 < M 



The first natural question to ask is whether this conditional random set is typically 



localized or not. See Figure X.4 



Intuitively, it is quite clear that the set Tn conditioned to be very small will tend to 



be localized. So it is the picture on the right in Figure X.4 which is more likely. This 
would deserve a proof of course, but we will come back to this later. The fact that it 
should look more and more localized tells us that as one shrinks m, this should make 
our conditional 7^" more and more singular with respect to the unconditional one. But 
how much localization should we see? This is again fairly easy to answer, at least on the 
intuitive level. Indeed, 7^" should tend to localize until it reaches a certain mesoscopic 
scale r such that 1 <^ r <^ n. One can compute how much it costs to maintain a single 
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How does 

c(ra o< 

\Tn \ < u] look ? 





m 

< u 








< u 










E3 






< u 



More Entropy ( in Vol^) 
but costs more to maintain 
these 3 "islands" alive. 



OR? 




Less Entropy ( in Vol^) but 
only one island to maintain 
alive. 



Figure X.4: Entropy v.s. Clustering effect 

branch (or 0(1) branches) alive until scale r, but once this is achieved, one should let 
the system evolve in a "natural" way. In particular, once the tree survives all the way 
to a mesoscopic square of size r, it will (by the second moment method) produce f2(r") 
leaves there with positive probability. 

To summarize, typically will maintain 0(1) many branches alive at scale 1 ^ 
r <^ n, and then it will let the branching structure evolve in a basically unconditional 
way. The intermediate scale r is chosen so that r'^ x u. 

Definition X.l. If \ < r < n = 2^ is such that r = 2'',0 < I < h, let T(r) denote the 
set of branches that were still alive at scale r = 2' m the iterative construction of Tn- 
In other words, T{r) = T^~^ and Tn C [JT(r)- This random set T{r) will he the analogue 



of the "r- smoothing" ,y^r) of the spectral sample defined later in Definition X.2 



Returning to our problem, the above heuristics say that one expects to have for any 
1 < u < n°. 

p[o < |r„| < m] X p[o < \r(r)\ < 0(1)] 
^^[\rir)\ = i], 

where r is a dyadic integer chosen such that r° x u. Or in other words, we expect that 



P[0< |7;.| <r"] xP[|7^,)| = l] 



(X.1) 



In the next subsection, we briefly explain how this heuristic can be implemented 
into a proof in the case of the tree 7^ in a way which will be suitable to the study of 
^f„- We will only skim through the main ideas for this tree case. 
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3.3 Setup of a proof for Tn 

Motivated by the above heuristics, we divide our system into two scales: above and 
below the mesoscopic scale r. One can write the lower tail event as follows (let 1 ^ 
r <^ n): 



P[0 < |r„| < r"] = ^ P[|7^,)| = k] P[0 < |r„| < r° I |7^,)| = k] . (X.2) 



k>l 



It is not hard to estimate the second term P[0 < |7^| < | |7{r)| = k'j. Indeed, in 
this term we are conditioning on having exactly k branches alive at scale r. Indepen- 
dently of where they are, "below" r, these k branches evolve independently of each 
other. Furthermore, by the second moment method, there is a universal constant c > 
such that each of them exceeds the fatal amount of r'^ leaves with probability at least 
c (note that in the opposite direction, each branch could also go extinct with positive 
probability). This implies that 

P[0< ITnl <r° I |7^,)| = k]< (l-c)^ 

Remark X.3. Note that one makes heavy use of the independence structure within Tn 
here. This aspect is much more nontrivial for the spectral sample ^f„- Fortunately it 
turns out, and this is a key fact, that in |GPS10j one can prove a weak independence 
statement which in some sense makes it possible to follow this route. 

We are left with the following upper bound: 

P[0 < ITnl < r"] < 5^ P [17^0 1 = k]il- c)'^ . (X.3) 

k>l 



In order to prove our goal of (X.l), by exploiting the exponential decay given by 
(1 — c)'^ (which followed from independence), it is enough to prove the following bound 
on the mesoscopic behavior of T: 

Lemma X.7. There is a sub-exponential function k i-)- g{k) such that for all 1 < 
r < n, 

F[\r^r)\ = k]<g{k)F[\r(^r)\ = l]. 



Notice as we did in Definition X.l that since 7(r) has the same law as T'^~', this is 
a purely Galton- Watson tree type of question. 

The big advantage of our strategy so far is that initially we were looking for a sharp 
control on P[0 < |7^,| < m] and now, using this "two-scales" argument, it only remains 
to prove a crude upper bound on the lower tail of |7(r-)|. By scale invariance this is 
nothing else than obtaining a crude upper bound on the lower tail of \%i\- Hence this 
division into two scales greatly simplified our task. 
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3.4 Sub-exponential estimate on the lower-tail (Lemma X.7) 



The first step towards proving and understanding Lemma X.7| is to understand the 
term P[|7(r-)| = l]- From now on, it will be easier to work with the "dyadic" notations 
instead, i.e. with 7~* = (see remark X.l). With these notations, the first step is 



equivalent to understanding the probabilities Pi := P[|T*| = Ij. This aspect of the 
problem is very specific to the case of Galton- Watson trees and gives very little insight 
into the later study of the spectrum . Therefore we postpone the details to Exercise 



X.2 The conclusion of this (straightforward) exercise is that pi behaves as i — )■ oo like 



Pi ~ C/i 



for an explicit exponent n G (0, 1) (see Exercise X.2). In particular, in order to prove 



Proposition X.6, it is now enough to find a sub-exponential function k g(^k) such 
that for any i,k > 1, 



F[\r\ = k] <g{k)fi\ 
More precisely, we will prove the following lemma. 



(X.4) 



Lemma X.8. Let g{k) := 2^i°si('=+2), where 9 is a fixed constant to be chosen later. 
Then for all i,k > 1, one has 



F[\r\ = k] <g{k)^i\ 



(X.5) 



We provide the proof of this lemma here, since it can be seen as a "toy proof" of the 
corresponding sub-exponential estimate needed for the r-smoothed spectral samples 



stated in the coming Theorem X.13, The proof of this latter theorem shares 



some similarities with the proof below but is much more technical since in the case of 
y{r) one has to deal with a more complex structure than the branching structure of a 
Galton- Watson tree. 

Proof. We proceed by double induction. Let k > 2 he fixed and assume that equa- 



tion (X.5) is already satisfied for all pair {i',k') such that k' < k. Based on this 



assumption, let us prove by induction on i that all pairs (i, k) satisfy equation (X.5) as 
well. 

First of all, if i is small enough, this is obvious by the definition of g{k). Let 

J = Jk ■■= sup{i > 1 : g{k)if > 10} . 



Then, it is clear that equation (X.5) is satisfied for all {i,k) with i < Jj^. Now let 
i > Jfc. 

If T* is such that |T*| = A; > 1, let L = L(J~^) > be the largest integer such 
that T* intersects only one square of size 2^^^. This means that below scale 2^~^, the 
tree T* splits into at least 2 live branches in distinct dyadic squares of size 2*~^~^. Let 
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d G {2, 3,4} be the number of such hve branches. By decomposing on the value of L, 
and using the above assumption, we get 



f>[\r\ = k\ <¥[L{r)>i-Jk\+ 



where q is the probabihty that our Galton- Watson tree goes extinct. 

Let us first estimate what P[L(T*) > m] is for m > 0. If m > 1, this means that 
among the 2^*" dyadic squares of size 2*""^, only one will remain alive all the way to 
scale 1. Yet, it might be that some other such squares are still alive at scale 2*""^ but 
will go extinct by the time they reach scale 1. Let Pm,b be the probability that the 
process 7"'""'"^ which lives in [0,2™+^]^, is entirely contained in a dyadic square of size 
2^. With such notations, one has 

F[L{V)>m] =Pm,i-n.- 

Furthermore, if i = m, one has = Pi ^ c/i*. It is not hard to prove (see Exercise 



X.2) the following lemma. 



Lemma X.9. For any value ofm,b> 0, one has 

Pm,b < IJ'"' ■ 

In particular, one has a universal upper bound in b > 0. 

It follows from the lemma that F[L{V) = l] < f[L{V) > /] < /i' and 

F[L{r)>i-Jk] <fi'~'' (X.6) 
< — g{k) /i* by the definition of Jk ■ (X.7) 

This gives us that for some constant C 

k j ^ 1 ^ ^ ^ k j —— k 

[kj)i<j<d 

k j 1 ; ^ ^ k j — k 
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Let us deal with the d = 2 sum (the contributions coming from d > 2 being even 
smaller). By concavity of /c i— 6\ogl{k + 2), one obtains that for any (fci, ^2) such that 
^1 + ^2 = k: g{ki)g{k2) < g{k/2Y. Since there are at most /c^ such pairs, this gives us 
the following bound on the d = 2 sum. 



1=0 



E l[g{k,)<J2f^"'k'9{k/2f 

{kj)i<j<2 ^ 



«=0 



< 



1 



< 10- 



/i 
1 



1-/X 



/x-^'^ k'g{k/2y 

k'gik/2nfxgik)r' 



by definition of Jk- 

Now, some easy analysis implies that if one chooses the constant > large enough, 
then for any k > 2, one has ClOj^k"^ g{k/2)'^ {fig{k)y^ < j^gik). Altogether (and 
taking into consideration the d > 2 contributions), this implies that 



F\\V 



k] < -g{k)fi' < g{k)fi' 



as desired. 



□ 



Remark X.4. Recall the initial question from Figure X.4 which asked whether the clus- 
tering effect wins over the entropy effect or not. This question enabled us to motivate 
the setup of the proof but in the end, we did not specifically address it. Notice that 



the above proof in fact solves the problem (see Exercise X.3). 



4 Back to the spectrum: an exposition of the proof 



4.1 Heuristic explanation 

Let us now apply the strategy we developed for Tn to the case of the spectral sample . 



Our goal is to prove Theorem X.3 (of which Theorems X.l and X.2 are straightforward 
corollaries). Let S^f^ C [0,r;,]^ be our spectral sample. We have seen (Theorem IX.6) 
that with positive probability x n^a4(n). For all 1 < m < n'^a^i^n), we wish to 

understand the probability P[0 < < u\. Following the notations we used for Tn, 

let S^j^ be the spectral sample conditioned on the event {0 < < u}. 

Question: How does typically look? 

To answer this question, one has to understand whether tends to be localized 



or not. Recall from Figure X.4 the illustration of the competition between entropy and 



4. BACK TO THE SPECTRUM: AN EXPOSITION OF THE PROOF 



119 



clustering effects in the case of Tn- The same figure apphes to the spectral sample 



We will later state a clustering lemma (Lemma X.14) which will strongly support 



the localized behavior described in the next proposition. 

Therefore we are guessing that our conditional set yj^ will tend to localize into 0(1) 
many squares of a certain scale r and will have a "normal" size within these r-squares. 
It remains to understand what this mesoscopic scale function of u is. 

By "scale invariance" , one expects that if is conditioned to live in a square of 
size r, then |=5^/„| will be of order r'^a^lr) with positive conditional probability. More 



precisely, the following lemma will be proved in Problem X.6 



Lemma X.IO. There is a universal c G (0, 1) such that for any n and for any r -square 
B C [n/4,3n/4]2 m the "hulk" of [Q,nf, one has 

H^Pj-, e (c, 1/c) I ^ and yf^cB]>c. (X.8) 



In fact this lemma holds uniformly in the position of the r-square B inside [0, n 
but we will not discuss this here. 



2 



What this lemma tells us is that for any 1 < u < n'^a4{n), if one chooses r = r„ in 
such a way that r^a^lr) x u, then we expect to have the following estimate: 

P[0 < \yfj < m] X P[^/„ intersects 0(1) r-squares in [0,n]^] 
X P[^/„ intersects a single r-square in [0,n]^] 

At this point, let us introduce a concept which will be very helpful in what follows. 

Definition X.2 ("r-smoothing"). Let 1 < r < n. Consider the domain [0,n]^ and 
divide it into a grid of squares of edge-length r. (If 1 ^ r ^ n, one can view this grid 
as a mesoscopic grid). 

Ifn is not divisible by r, write n = mr + q and consider the grid of r-squares covering 
[0, (m+l)r]2. 

Now, for each subset S C [0,n]^, define S(^r) to be the set of r x r squares in the 
above grid which intersect S. In particular \S(^r)\ will correspond to the number of such 
r-squares which intersect S . With a slight abuse of notation, S(^r) will sometimes also 
denote the actual subset of [0,ri]^ consisting of the union of these r-squares. 

One can view the application S t— )■ S(^r) an r-smoothing since all the details 
below the scale r are lost. 



Remark X.5. Note that in Definition X.l we relied on a slightly different notion of 
"r-smoothing" since in that case, T(r) could also include r-branches which might go 
extinct by the time they reached scale one. The advantage of this choice was that there 
was an exact scale-invariance from T to T{r) while in the case of , there is no such 
exact scale-invariance from y to y^r)- 
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With these notations, the above discussion leads us to beheve that the following 
proposition should hold. 

Proposition X.ll. For all 1 < r < n, one has 

P[0 < l^fj < r'a,{r)] x [|^(,)| = l] . 
Before explaining the setup used in |GPS10] to prove such a result, let us check that 



it indeed implies Theorem X.3 By neglecting the boundary issues, one has 



P/. 0^(OI = 1] ^ E H-^f- ^ and C B\ . (X.9) 



r-squares 
B C [n/4,3n/4]^ 



There are 

0{S) such 5 squares, and for each of these, one can check (see Exercise X.5| 

that 

P ^ and C S] X «4(r, nf . (X.IO) 



Therefore, Proposition X.ll| indeed implies Theorem X.3 



4.2 Setup and organization of the proof of Proposition X.ll 



To start with, assume we knew that disjoint regions in the spectral sample 5^^^ behave 
more or less independently of each other in the following (vague) sense. For any k > 
1 and any mesoscopic scale 1 < r < n, if one conditions on to be equal to 
Bi U ■ ■ ■ U Bk for k disjoint r-squares, then the conditional law of ^\\jb^ should be 
"similar" to an independent product of C[y\Bi | H i?j 7^ 0] , i G {1, . . . , k}. Similarly 
as in the tree case (where the analogous property for Tn was an exact independence 
factorization), and assuming that the above comparison with an independent product 
could be made quantitative, this would potentially imply the following upper bound for 
a certain absolute constant c > 0: 

F[0 <\yf J Kr^a^ir)] < J] P[|^(,)| = A;] (1 - c)^ (X.ll) 

fc>i 

This means that even if one managed to obtain a good control on the dependency 
structure within S^f^ (in the above sense), one wo uld sti ll need to have a good estimate 



on P[|t5^(r)| = /c] in order to deduce Proposition X.ll, This part of the program is 
achieved in |GPS10j without requiring any information on the dependency structure of 
S^f^. More precisely, the following result is proved: 

Theorem X.12 ( |GPS10j ). There is a sub- exponential function g i— )■ g{k), such that 
for any 1 < r < n and any k > 1, 

F[\yir)\ = k] < g{k)F[\y(^r)\ = l]. 
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The proof of this result will be described briefly in the next subsection. 



One can now describe how the proof of Theorem X.3 is organized in |GPS10j . It is 
divided into three main parts: 

1. The first part deals with proving the multi-scale sub-exponential bound on the 



lower-tail of \^{r)\ given by Theorem X.12 



The second part consists in proving as much as we can on the dependency struc- 
ture of ^f„- Unfortunately here, it seems to be very challenging to achieve a good 
understanding of all the "independence" that should be present within ^f^. The 
only hint of independence which was finally proved in |GPS10j is a very weak one 



(X.ll) 



see subsection 4.4). In particular, it is too weak to readily imply a bound like 



Since disjoint regions of the spectral sample are not known to behave inde- 
pendently of each other, the third part of the proof consists in adapting the setup 
we used for the tree (where distinct branches evolve exactly independently of each 
other) into a setup where the weak hint of independence obtained in the second 



part of the program turns out to be enough to imply the bound given by (X.ll ) 
for an appropriate absolute constant c > 0. This final part of the proof will be 
discussed in subsection 14.51 

The next three subsections will be devoted to each of these 3 parts of the program. 



4.3 Some words about the sub-exponential bound on the lower 
tail of c5^(j.) 

In this subsection, we turn our attention to the proof of the first part of the program. 



i.e. on Theorem X.12 In fact, as in the case of Tn, the following more explicit statement 



is proved in |GPS10j . 

Theorem X.13 ( |GPS10] ). There exists an absolute constant 6 > such that for any 
1 < r < n and any k > 1, 

P[|^(,)| = A;] < 2^'°s'('^+2) P[|^(,)| = 1] . 

Remark X.6. Note that the theorems from |BKS99j on the noise sensitivity of percola- 
tion are all particular cases (r = 1) of this intermediate result in jGPSlOj . 

The main idea in the proof of this theorem is in some sense to assign a tree structure 
to each possible set S^[r)- The advantage of working with a tree structure is that it is eas- 
ier to work with inductive arguments. In fact, once a mapping ^ "tree structure" 
has been designed, the proof proceeds similarly as in the case of 7(r) by double induction 
on the depth of the tree as well as on /c > 1. Of course, this mapping is a delicate affair: 
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it has to be designed in an "efficient" way so that it can compete against entropy effects 
caused by the exponential growth of the number of tree structures. 

We will not give the details of how to define such a mapping, but let us describe 
informally how it works. More specifically than a tree structure, we will in fact assign 
an annulus structure to each set S^(t)- 

Definition X.3. Let A he a finite collection of disjoint (topological) annuli in the plane. 
We call this an annulus structure. Furthermore, we will say that a set S G M."^ is 
compatible with A (or vice versa) if it is contained m \ and intersects the 
inner disk of each annulus in A. Note that it is allowed that one annulus is "inside" of 
another annulus. 




Figure X.5: An example of an annulus structure A := {^i, A2, A^} compatible with 
a spectral sample 

The mapping procedure in [GPSlOj assigns to each ^(r) an annulus structure A C 
[0,n]^ in such a way that it is compatible with S^(r)- See Figure X.5 for an example. 



Again, we will not describe this procedure nor discuss the obvious boundary issues 
which arise here, but let us state a crucial property satisfied by annulus structures. 



Lemma X.14 (clustering Lemma). If A is an annulus structure contained in [0, n]"^, 
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then 

P[^(r) is compatible with A] < Y\ a4(^)^ , 

where 04 (A) denotes the probability of having a four-arm event in the annulus A. 

Remark X.7. To deal with boundary issues, one would also need to incorporate within 
our annulus structures half-annuli centered on the boundaries as well as quarter disks 
centered at the corners of [0,n]^. 

Let us briefly comment on this lemma. 

• First of all, its proof is an elegant combination of linear algebra and percolation. 
It is a short and relatively elementary argument. See Lemma 4.3 in |GPS10] . 

• It is very powerful in dealing with the possible non-injectivity of the mapping 
iS(j.) H- A. Indeed, while describing the setup above, one might have objected that 
if the mapping were not injective enough, then the cardinality of the "fibers" above 
each annulus structure would have to be taken into account as well. Fortunately, 
the above lemma reads as follows: for any fixed annulus structure A, 



^ P[^{r)] < P[^{r) is compatible with ^] < JJ "4(^)^ • 

y(ry-y(r)^A AeA 

Another essential feature of this lemma is that it quantifies very efficiently the fact 



that the clustering effect wins over the entropy effect in the sense of Figure X.4 
The mechanism responsible for this is that the probability of the four-arm event 
squared has an exponent (equal to 5/2 on T) larger than the volume exponent 
equal to 2. To illustrate this, let us analyze the situation when k = 2 (still 
neglecting boundary issues). The probability that the spectrum intersects two 
and only two r-squares at macroscopic distance Q{n) from each other can be easily 
estimated using the lemma. Indeed, in such a case, y^r) would be compatible with 
an annulus structure consisting of two annuli, each being approximately of the 
type A{r,n). There are O(^) x O(^) such possible annulus structures. Using 
the lemma each of them costs (on T) An easy exercise shows that this 

is much smaller than P[|.5^(r)| = 2] . In other words, if \^(r)\ is conditioned to be 
small, it tends to be localized. Also, the way that the lemma is stated makes it 
very convenient to work with higher values of k. 



The details of the proof of Theorem X.13 can be found in |GPS10j . The double 



induction there is in some sense very close to the one we carried out in detail in sub- 



section 3^ in the case of the tree; this is the reason why we included this latter proof. 
For those who might read the proof in |GPS10j . there is a notion of overcrowded cluster 
defined there; it exactly corresponds in the case of the tree to stopping the analysis 
above scale Jk instead of going all the way to scale 1 (note that without stopping at 



this scale Jk, the double induction in subsection 3.4 would have failed). 
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4.4 Some words on the weak independence property proved 
in [IGPSlOl 

This part of the program is in some sense the main one. To introduce it, let us start by 



a naive but tempting strategy. What the first part of the program (Theorem X. 13 ) tells 
us is that for any mesoscopic scale 1 < r < n, if S^f^ is non-empty, it is very unlikely 
that it will intersect few squares of size r. In other words, it is very unlikely that |t5^(r)| 
will be small. Let Bi, . . . ,Bm denote the set of 0{n^/r^) r-squares which tile [0, n]^. 
One might try the following scanning procedure: explore the spectral sample =5^/„ inside 
the squares Bi one at a time. More precisely, before starting the scanning procedure, 
we consider our spectral sample as a random subset of [0,?7,]^ about which we do 
not know anything yet. Then, at step one, we reveal ^\Bi- This gives us some partial 



information about ^f„- What we still have to explore is a random set of [0,n] \ Bi 
which follows the law of a spectral sample conditioned on what was seen in Bi and we 



keep going in this way. By Theorem X.13 many of these squares will be non-empty. 



Now, it is not hard to prove the following lemma (using similar methods as in Problem 



X.6). 



Lemma X.15. There is a universal constant c > such that for any r -square B in the 
bulk [72/4,872/4]^, one has 

P[|^/„ n 5| > cr^a^ir) \ y^^^nB ^(/}] > c. 

This lemma in fact holds uniformly in the position of B inside [0,n]^. 
If one could prove the following (much) stronger result: there exists a universal 
constant c > such that uniformly on the sets S G [0, n]^ \ B one has 



P[|^/„ nB\> cr^a^ir) \ n 7^ and = S] >c, 



(X.12) 



then it would not be hard to make the above scanning strategy work together with 
Theorem X.13 in order to obtain Theorem X.3 (Note that such a result would indeed 



give a strong hint of independence within ^f„-) However, as we discussed before, 
the current understanding of the independence within is far from giving such a 
statement. Instead, the following result is proved in |GPS10j . We provide here a 
slightly simplified version. 



Theorem X.16 ( |GPS10] ). There exists a uniform constant c > such that for any 
set W C [0, n]^ and any r -square B such that B nW = ^, one has 



P[|^/„ n fi| > cr'^a4{r) \ H B ^ (Is and H W = fll] > c. 



Note that this theorem in some sense interpolates between part of Lemma X.IO and 
Lemma X.15 which correspond respectively to the special cases W = B'^ and W = ^. 
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Yet it looks very weak compared to the expected (X.12) which is stated uniformly on 
the behavior of outside of B. 

Assuming this weak hint of independence (Theorem X.16), it seems we are in bad 
shape if we try to apply the above scanning procedure. Indeed, we face the following 
two obstacles: 



1. The first obstacle is that one would keep a good control only as far as one would 
not see any "spectrum". Namely, while revealing one at a time, the first 
time one finds a square Bi such that 7^ 0, one would be forced to stop the 
scanning procedure there. In particular, if the size of the spectrum in this first 
non-trivial square does not exceed r^a4(r), then we cannot conclude anything. 

2. The second obstacle is that, besides the conditioning y H W = (/i, our estimate 
is also conditioned on the event that ^5^ fl i? 7^ 0. In particular, in the above 
"naive" scanning strategy where squares are revealed in a sequential way, at each 
step one would have to update the probabihty that fl Bj+i 7^ based on what 
was discovered so far. 



It is the purpose of the third part of the program to adapt the above scanning 
strategy to these constraints. Before describing this third part in the next subsection. 



let us say a few words on how to prove Theorem X.16 



A crucial step in the proof of this theorem is to understand the following "one-point 
function" for any x G -B at distance at least r/3 from the boundary: 

F[x G and y^^nW = 0\. 

A very useful observation is to rewrite this one-point function in terms of an explicit 
coupling of two i.i.d. percolation configurations. It works as follows: let (001,002) be a 
coupling of two i.i.d. percolations on [0,n]^ which are such that 

ooi = 002 on 

001,002 are independent on W 

One can check that the one-point function we are interested in is related to this coupling 
in the following simple way: 

P[x G and y^^nW = (ls] =F[x is pivotal for ooi AND ^2] • 

Remark X.8. You may check this identity in the special cases where W = ^ or W = 

{xy. 



Thanks to this observation, the proof of Theorem |X.16| proceeds by analyzing this 
ly-coupling. See |GPS10j for the complete details. 
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4.5 Adapting the setup to the weak hint of independence 



As we discussed in the previous subsection, one faces two main obstacles if, on the 



basis of the weak independence given by Theorem X.16 one tries to apply the naive 
sequential scanning procedure described earlier. 

Let us start with the first obstacle. Assume that we scan the domain [0, n]^ in a se- 
quential way, i.e., we choose an increasing family of subsets (PF/)/>i = {{wi, . . . , wi})i>i. 
At each step, we reveal what is, conditioned on what was discovered so far (i.e.. 



conditioned on oJ^vi/J- From the weak independence Theorem X.16, it is clear that if 



we want this strategy to have any chance to be successful, we have to choose {Wi)i>i 
in such a way that {^f„ H Wi)i>i will remain empty for some time (so that we can 
continue to rely on our weak independence result); of course this cannot remain empty 
forever, so the game is to choose the increasing family {Wi)i>i in such a way that the 
first time fl {wi} will happen to be non-empty, it should give a strong indication 
that S^f^^ is large in the r-neighborhood of Wi. 

As we have seen, revealing the entire mesoscopic boxes Bi one at a time is not a 
successful idea. Here is a much better idea (which is not yet the right one due to the 
second obstacle, but we are getting close): in each r-square Bi, instead of revealing all 



the bits, let us reveal only a very small proportion 6r of them. Lemma |X.15| tells us 
that if =5^ n 7^ 0, then each point x E Bi has probability of order 04 (r) to be in yf„. 
Therefore if we choose 6r -C (r^a4(r))~^, then with high probability, by revealing only 
a proportion 6r of the points in Bi, we will "miss" the spectral sample o$^/„. Hence, we 
have to choose 6r > (r^a4(r))~^. In fact choosing 6 x (r^a;4(r))~^ is exactly the right 



balance. Indeed, we know from Theorem X.13 that many r-squares Bi will be touched 
by the spectral sample; now, in this more sophisticated scanning procedure, if the first 
such square encountered happens to contain few points (i.e. ^ r'^a^i^r)), then with 
the previous scanning strategy, we would "lose", but with the present one, due to our 
choice of 6r, most likely we will keep S^f^ fl PV/ = so that we can continue further on 
until we reach a "good" square (i.e. a square containing of order r^a4(r) points). 



Now, Theorems X.13 and X.16 together tell us that with high probability, one will 
eventually reach such a good square. Indeed, suppose the m first r-squares touched 
by the spectral sample happened to contain few points; then, most likely, if Wi^ is the 
set of bits revealed so far, by our choice of 6r we will still have y fl Wi^ = 0. This 



allows us to still rely on Theorem |X.16[ which basically tells us that there is a positive 
conditional probability for the next one to be a "good" square (we are neglecting the 
second obstacle here). This says that the probability to visit m consecutive bad squares 



seems to decrease exponentially fast. Since m is typically very large (by Theorem X.13 ), 
we conclude that, with high probability, we will finally reach good squares. In the first 
good square encountered, by our choice of 6r, there is now a positive probability to 
reveal a bit present in In this case, the sequential scanning will have to stop, since 
we will not be able to use our weak independence result anymore, but this is not a big 
issue: indeed, assume that you have some random set S G B. If by revealing each bit 
only with probability 6r, you end up finding a point in S, most likely your set S is at 
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least of size Q{r'^a4{r)). This is exactly the size we are looking for in Theorem X.3 

Now, only the second obstacle remains. It can be rephrased as follows: assume 
you applied the above strategy in Bi, . . . ,Bh (i.e. you revealed each point in Bi, i G 
{1,. . . ,h} only with probability 5,.) and that you did not find any spectrum yet. In 
other words, if Wi denotes the set of points visited so far, then nWi = 0. Now if 
Bh^i is the next r-square to be scanned (still in a "dilute" way with intensity 6r), we 
seem to be in good shape since we know how to control the conditioning S^f^ CiWi = ^. 



However, if we want to rely on the uniform control given by Theorem X.16 , we also 
need to further condition on fl Bh+i 7^ 0. In other words, we need to control the 
following conditional expectation: 

It is quite involved to estimate such quantities. Fortunately, by changing our sequential 
scanning procedure into a slightly more "abstract" procedure, one can avoid dealing 
with such terms. More precisely, within each r-square B, we will still reveal only a 6r 
proportion of the bits (so that the first obstacle is still taken care of), but instead of 
operating in a sequential way (i.e. scanning Bi, then B2 and so on), we will gain a lot 



by considering the combination of Theorem X.13 and Theorem X.16 in a more abstract 



fashion. Namely, the following large deviation lemma from |GPS10j captures exactly 
what we need in our present situation. 



Lemma X.17 ( |GPS1 0]). Let Xi,Yi G {0, 1}, i G {1, . . . ,m} be random variables such 
that for each i Yi < Xi a.s. If^J C [m] and Vi G [m] \ J , we have 

p[y. = 1 I y. = 0, Vj G J] > c P[Xi = 1 I Yj = 0, Vj G J] , (X.13) 

then if X := "^Xi and Y := "^Yi, one has that 

F[Y = I X > 0] < c-^E[e"("/^)^ | X > O] . 

Recall that Bi, . . . , Bm denotes the set of r-squares which tile [0,n]^. For each 
i G [m], let Xi := l^f^g.^^j^ and Y^ := l,yr\Bir\Wi^%i where W is an independent uniform 
random subset of [0, ra]^ of intensity 5r- 

This lemma enables us to combine our two main results. Theorems X.16| and X.13 



in a very nice way: By our choice of the intensity 5^, Theorem X.16 exactly states 



that the assumption (X.13) is satisfied for a certain constant c > 0. Lemma X.17 then 
implies that 



P[F = I X > 0] < c"^E[e"("/^)^ I X > 0] . 

Now, notice that X = ^Xj exactly corresponds to \y{r)\ while the event {X > 0} 
corresponds to 7^ 0} and the event {F = 0} corresponds to fl W = 0}. 

Therefore Theorem IX. 131 leads us to 
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P[^/„ n W = , 7^ 0] < c-iE[e-(^/^)l^('-)l , ^ 0] 



k>l 



k>l 

<Cie)F[\y^r)\ = l] x-a4(r,n) 



r-)l 



1] 



(X.14) 



where (X.IO) is used in the last step. 

This shows that on the event that S^f^^ ^ 0, it is very unhkely that we do not detect 
the spectral sample on the (5r-dilute set W. This is enough for us to conclude using the 
following identity: 



p[^/„ n w = I yf„] = (1 - = (1 



Indeed, by averaging this identity we obtain 



1 



r^a4(r) ' 



]\-y'fn\ 



P [Yf^ n W = , ^ 0] = E [P [Yf^ n W = I Yf^] ly^.^^,] 

> ^](1)P[0 < l^/J < r'^ai[ 



which, combined with (X.14) yields the desired upper bound in Theorem X.3 See 
Problem IX. 71 for the lower bound. 



5 The radial case 

The next chapter will focus on the existence of exceptional times in the model of dy- 
namical percolation. A main tool in the study of these exceptional times is the spectral 
measure Q^,^ where gpc is the Boolean function := { — 1, l}'^^^^) — > {0, 1} defined 
to be the indicator function of the one-arm event {0 < — > dB{0, R)}. Note that by 
definition, gn is such that WgnWl = ai(-R). 

In |GPS10j . the following "sharp" theorem on the lower tail of ^5^^^ is proved. 

Theorem X.18 ( |GPS10j ). Let g^ he the one-arm event in B{0,R). Then for any 
1 < r < R, one has 

Q,JO < |^,J < r'a,{r)] x . (X.15) 
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X.3) 



The proof of this theorem is in many ways similar to the chordal case (Theorem 
An essential difference is that the "clustering v.s. entropy" mechanism is very 



different in this case. Indeed in the chordal left to right case, when is conditioned 
to be very small, the proof of Theorem X.3 shows that typically S^f^ localizes in some 
r-square whose location is "uniform" in the domain [0,n]^. In the radial case, the 
situation is very different: conditioned to be very small will in fact tend to localize 
in the r-square centered at the origin. This means that the analysis of the mesoscopic 
behavior (i.e. the analogue of Theorem X.13) has to be adapted to the radial case. In 
particular, in the definition of an annulus structure, the annuli containing the origin 
play a distinguished role. See |GPS10j for complete details. 
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Exercise sheet on chapter 



X 



Exercise X.l. Prove Proposition |X.4 



Exercise X.2. Consider tlie fractal percolation process T*, i > 1 introduced in this 
cliapter. (Recall that = T*). Recall that in Section [sj it was important to estimate 
the quantity P[|T*| = l]. This is one of the purposes of the present exercise. 

l] . By recursion, show that there is a constant c G (0,1) so 



(a) Let Pi := F[\V 
that, as i — )■ oo 

Pi ~ Cfl' , 

where /i := 4p(l — p + pqf" and q is the probability of extinction for the Galton- 
Watson tree correponding to (T*)i>i. 

(b) Using the generating function s ^ /(s)(= E{s number of offspring^ ^^^^^^ Galton- 
Watson tree, and by studying the behavior of its i-th. iterates f^'^\ prove the same 
result with /i := f'{q). Check that it gives the same formula. 



(c) Recall the definition of Pm,b from Section |3| Let Pm,oo be the probability that 
exactly 1 person at generation m survives forever. Prove that 



Pn 



for the same exponent /x. Prove Lemma X.9 Finally, prove that lim;,^ooPm,6 

Pm,oo ■ 



Exercise X.3. Extract from the proof of Lemma X.8 the answer to the question asked 



in Figure X.4 



Exercise X.4. Prove that 

Theorem IX. 31 =^ Theorem IX. 21 =^ Theorem IX. II 
Exercise X.5. Consider an r-square B C [n/4, 3n/4]^ in the "bulk" of [0,n]^. 



(a) Prove using Proposition IX. 3 that 



^ and C 5] X a^ir, nf 
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(b) Check that the clustering Lemma X.14 is consistent with this estimate 



Problem X.6. The purpose of this exercise is to prove Lemma X.IO 



(a) Using Proposition IX. 3, prove that for any x E B at distance r/3 from the bound- 
ary, 

P[x G and n 5^ = 0] x a4(r)a4(r, . 



(b) Recover the same result using Proposition IX. 4 instead. 



(c) Conclude using Exercise X.5 that E[|^/„ f] B\ | ^ and C fi] x r'^a4{r) 
where B G B is the set of points a; G -B at distance at least r/3 from the boundary. 

(d) Study the second-moment ]E[|^/„ fl i?p | 7^ and C B] . 



(e) Deduce Lemma X.IO 



Problem X.7. Most of this chapter was devoted to the explanation of the proof of 



Theorem |X.3[ Note that we in fact only discussed how to prove the upper bound. 
This is because the lower bound is much easier to prove and this is the purpose of this 
problem. 



(a) Deduce from Lemma 



X.IO 



and Exercise 



X.5 



[a) that the lower bound on P[0 < < r^Q;4( 



given in Theorem X.3 is correct. I.e., show that there exists a constant c > such 
that 



P[0 < \yfj < r^ai{r)] > c^ai{r,n) 



(b) (Hard) In the same fashion, prove the lower bound part of Theorem 



X.18 



Chapter XI 



Applications 
percolation 



to dynamical 



In this section, we present a very natural model where percolation undergoes a time- 
evolution: this is the model of dynamical percolation described below. The study 
of the "dynamical" behavior of percolation as opposed to its "static" behavior turns 
out to be very rich: interesting phenomena arise especially at the phase transition 
point. We will see that in some sense, dynamical planar percolation at criticality is a 
very unstable or chaotic process. In order to understand this instability, sensitivity of 
percolation (and therefore its Fourier analysis) will play a key role. In fact, the original 
motivation for the paper |BKS99] on noise sensitivity was to solve a particular problem 
in the subject of dynamical percolation. |Ste09j provides a recent survey on the subject 
of dynamical percolation. 

We mention that one can read all but the last section of the present chapter without 
having read Chapter |X} 

1 The model of dynamical percolation 

This model was introduced by Haggstrom, Peres and Steif |HPS97j inspired by a ques- 
tion that Paul Malliavin asked at a lecture at the Mittag-LefHer Institute in 1995. This 
model was invented independently by Itai Benjamini. 

In the general version of this model as it was introduced, given an arbitrary graph 
G and a parameter p, the edges of G switch back and forth according to independent 
2-state continuous time Markov chains where closed switches to open at rate p and 
open switches to closed at rate 1 — p. Clearly, the product measure with density p, 
denoted by vr^ in this chapter, is the unique stationary distribution for this Markov 
process. The general question studied in dynamical percolation is whether, when we 
start with the stationary distribution tt^, there exist atypical times at which the perco- 
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lation structure looks markedly different than that at a fixed time. In almost all cases, 
the term "markedly different" refers to the existence or nonexistence of an infinite con- 
nected component. Dynamical percolation on site percolation models, which includes 
our most important case of the hexagonal lattice, is defined analogously. 

We very briefly summarize a few early results in the area. It was shown in |HPS97j 
that below criticality, there are no times at which there is an infinite cluster and above 
criticality, there is an infinite cluster at all times. See the exercises. In [ HPS97j . exam- 
ples of graphs which do not percolate at criticality but for which there exist exceptional 
times where percolation occurs were given. (Also given were examples of graphs which 
do percolate at criticality but for which there exist exceptional times where percolation 
does not occur.) A fairly refined analysis of the case of so-called spherically symmetric 
trees was given. See the exercises for some of these. 

Given the above results, it is natural to ask what happens on the standard graphs 
that we work with. Recall that for Z^, we have seen that there is no percolation at 
criticality. It turns out that it is also known (see below) that for d > 19, there is no 
percolation at criticality for Z'^. It is a major open question to prove that this is also 
the case for intermediate dimensions; the consensus is that this should be the case. 



2 What's going on in high dimensions: Z^, (i > 19? 

For the high dimensional case, Z*^, d > 19, it was shown in |HPS97] that there are no 
exceptional times of percolation at criticality. 

Theorem XI. 1 ( |HPS97j ). For the integer lattice with d > 19, dynamical critical 
percolation has no exceptional times of percolation. 

The key reason for this is a highly nontrivial result due to work of Hara and Slade 
( |HS94j ). using earlier work of Barsky and Aizenman ( jBA91] ). that says that if 6{p) is 
the probability that the origin percolates when the parameter is p, then for p > Pc 

e{p) = 0{p - Pc) . (xi.i) 

(This implies in particular that there is no percolation at criticality.) In fact, this 
is the only thing which is used in the proof and hence the result holds whenever the 
percolation function satisfies this "finite derivative condition" at the critical point. 

Outline of Proof. By countable additivity, it suffices to show that there are no times 
at which the origin percolates during [0,1]. We use a first moment argument. We 
break the time interval [0, 1] into m intervals each of length 1/m. If we fix one of these 
intervals, the set of edges which are open at some time during this interval is i.i.d. with 
density about pc + 1/m. Hence the probability that the origin percolates with respect 



to these set of edges is by (XI.I) at most 0(l/m). It follows that if A^^ is the number 
of intervals where this occurs, then E[Am] is at most 0(1). It is not hard to check that 
A^ < liminfm A^m, where A^ is the cardinality of the set of times during [0, 1] at which 
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the origin percolates. Fatou's Lemma now yields that E(A^) < oo and hence there are 
at most finitely many exceptional times during [0, 1] at which the origin percolates. To 
go from here to having no exceptional times can either be done by using some rather 
abstract Markov process theory or by a more hands on approach as was done in jHPS97j 
and which we refer to for details. □ 



Remark XI. 1. It is known that (XI.l) holds for any homogeneous tree (see |Gri99] for 
the binary tree case) and hence there are no exceptional times of percolation in this 
case also. 



Remark XI. 2. It is was proved by Kesten and Zhang |KZ87j . that (XI.l ) fails for and 
hence the proof method above to show that there are no exceptional times fails. This 
infinite derivative in this case might suggest that there are in fact exceptional times for 
critical dynamical percolation on 7?, an important question left open in |HPS97] . 



3 d = 2 and BKS 

One of the questions posed in |HPS97j was whether there are exceptional times of 
percolation for Z^. It was this question which was one of the main motivations for 
the paper |BKS99j . While they did not prove the existence of exceptional times of 
percolation, they did obtain the following very interesting result which has a very 
similar fiavor. 

Theorem XI. 2 ( |BKS99] ). Consider an Rx R box on which we run critical dynamical 
percolation. Let Sr be the number of times during [0, 1] at which the configuration 
changes from having a percolation crossing to not having one. Then 

Sr ^ oo in probability as R oo. 



Noise sensitivity of percolation as well as the above theorem tells us that certain large 
scale connectivity properties decorrelate very quickly. This suggests that in some vague 
sense uf" "changes" very quickly as time goes on and hence there might be some chance 
that an infinite cluster appears since we are given many "chances" . 

In the next section, we begin our study of exceptional times for Z^ and the hexagonal 
lattice. 



4 The second moment method and the spectrum 

In this section, we reduce the question of exceptional times to a "second moment 
method" computation which in turn reduces to questions concerning the spectral be- 
havior for specific Boolean functions involving percolation. Since p = 1/2, our dynamics 
can be equivalently defined by having each edge or hexagon be rerandomized at rate 1. 
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The key random variable which one needs to look at is 



• 1 

X = Xr = I 1„ cu, „dt 







where R is of course the event that at time t there is an open path from the 
origin to distance R away. Note that the above integral is simply the Lebesgue measure 
of the set of times in [0, 1] at which this occurs. 

We want to apply the second moment method here. We isolate the easy part of the 
argument so that the reader who is not familiar with this method understands it in a 
more general context. However, the reader should keep in mind that the difficult part is 
always to prove the needed bound on the second moments which in this case is (XI. 2). 

Proposition XI. 3. // there exists a constant C such that for all R 

nxi) < cnxR)\ (XI.2) 

then a.s. there are exceptional times of percolation. 

Proof. For any nonnegative random variable F , the Cauchy-Schwarz inequality applied 
to F/{y>o} yields 

p(r > 0) > e(f)Ve(f^). 



Hence by (XI.2), we have that for all i?, 

n^R > 0) > i/c 

and hence by countable additivity (as we have a decreasing sequence of events) 

n^R{XR > 0}) > i/c. 

Had the set of times that a fixed edge is on been a closed set, then the above would 
have yielded by compactness that there is an exceptional time of percolation with 
probability at least 1/C. However, this is not a closed set. On the other hand, this 
point is very easily fixed by modifying the process so that the times each edge is on is 
a closed set and observing that a.s. no new times of percolation are introduced by this 
modification. The details are left to the reader. Once we have an exceptional time with 
positive probability, ergodicity immediately implies that this occurs a.s. □ 

The first moment of Xr is, due to Fubini's Theorem, simply the probability of our 
one-arm event, namely ai{R). The second moment of Xr is easily seen to be 

E{X^) = E{[ [ lQ^jil^^j^dsdt)= [ [ F{0 R,0 R)dsdt (XI.3) 
Jo Jo ^ ^ ^ ^ Jo Jo 

which is, by time invariance, at most 

i^R,Oi^R) ds. (XI.4) 
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The key observation now, which brings us back to noise sensitivity, is that the 
integrand P(0 R, R) is precisely E,[fji{u)fR{ue)] where fn is the indicator of 
the event that there is an open path from the origin to distance R away and e = 1 — 
since looking at our process at two different times is exactly looking at a configuration 
and a noisy version. 

What we have seen in this subsection is that proving the existence of exceptional 
times comes down to proving a second moment estimate and furthermore that the 
integrand in this second moment estimate concerns noise sensitivity, something for 
which we have already developed a fair number of tools to handle. 

5 Proof of existence of exceptional times for the 
hexagonal lattice via randomized algorithms 

In |SS10b] . exceptional times were shown to exist for the hexagonal lattice; this was 
the first transitive graph for which such a result was obtained. However, the methods 
in this paper did not allow the authors to prove that had exceptional times. 

Theorem XI. 4 ( |SS10b] ). For dynamical percolation on the hexagonal lattice T at the 
critical point Pc = 1/2, there exist almost surely exceptional times t G [0, oo) such that 
LOt has an infinite cluster. 



Proof. As we noted in the previous section, two different times of our model can be 
viewed as "noising" where the probability that a hexagon is rerandomized within t units 



of time is 1 — e . Hence, by (IV.2), we have that 



%^SQB(S),R) 

where -8(0, R) are the set of hexagons involved in the event /r. We see in this expression 
that, for small times t, the frequencies contributing in the correlation between {0 
R} and {0 ^— ^ R} are of "small" size \S\ < 1/t. Therefore, in order to detect the 
existence of exceptional times, one needs to achieve good control on the lower tail of 
the Fourier spectrum of fR. 

The approach of this section is to find an algorithm minimizing the revealment as 



much as possible and to apply Theorem VIII. 1 However there is a difficulty here, since 



our algorithm might have to look near the origin, in which case it is difficult to keep the 
revealment small. There are other reasons for a potential problem. If R is very large 
and t very small, then if one conditions on the event {0 ^ i?}, since few sites are 
updated, the open path in uq from to distance R will still be preserved in ujt at least 
up to some distance L{t) (further away, large scale connections start to decorrelate). 
In some sense the geometry associated to the event {0 R} is "frozen" on a certain 
scale between time and time t. Therefore, it is natural to divide our correlation 
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analysis into two scales: the ball of radius r = r{t) and the annulus from r(t) to R. 
Obviously the "frozen radius" r = r{t) increases as t — > 0. We therefore proceed as 
follows instead. For any r, we have 



P[0 4^i?, Oi^i?] < P[0 4^ r] P[r ^i?, r^i?] 

< ai(r)E[/,.ij(wo)/r,/?(wt)], 



(XI.6) 



where fr^R is the indicator function of the event, denoted by r i?, that there is an 
open path from distance r away to distance R away. Now, as above, we have 



^[fr,RMfr,R{^t)\ < E + ^ exp(-tfc) fr,R{Sf 

\S\=k 



(XI.7) 



fc=i 



The Boolean function fr^R somehow avoids the singularity at the origin, and it is 
possible to find algorithms for this function with small revealments. In any case, letting 
^ = ^r,i? be the revealment of fr,R, it follows from Theorem VIII. 1 and the fact that 

A;exp(-tfc) < 0{l)/t^ that 



^kRMfrA^t)] < a^{r,Rf + 0{l)5a^{r,R)/t\ 



(XI.8) 



The following proposition gives a bound on 5. We will sketch why it is true after- 
wards. 



Proposition XI.5 f |SS10bj ). Let 2 < r < R. Then 

Sr,R<Oil)ai{r, R) «2(r) . 



(XI.9) 



Putting together (XI.6), (XI.8), Proposition XI.5 and using quasi-multiplicativity 
of «! yields 

ai{Ry f a2(r)~ 



P[0^i?, O^i?] <0(1)- 



aAr) 



1 + 



t2 



This is true for all r and t. If we choose r = r(t) = and ignore o(l) terms in 

the critical exponents (which can easily be handled rigorously), we obtain, using the 
explicit values for the one and two-arm critical exponents, that 



P[0 ^ i?, 4^ i?] < 0(l)t-^/6«i(i?)2 . 



(XI.IO) 



Now, since t ^/^dt < oo, by integrating the above correlation bound over the unit 
interval, one obtains that EfX^] < CE[X^] for some constant C as desired. □ 



Outline of proof of Proposition XI.5 



We use an algorithm that mimics the one we used for percolation crossings except the 
present setup is "radial". As in the chordal case, we randomize the starting point of 
our exploration process by choosing a site uniformly on the 'circle' of radius R. Then, 
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we explore the picture with an exploration path 7 directed towards the origin; this 
means that as in the case of crossings, when the interface encounters an open (resp. 
closed) site, it turns say to the left (resp. right), the only difference being that when 
the exploration path closes a loop around the origin, it continues its exploration inside 
the connected component of the origin. (It is known that this discrete curve converges 
towards radial SLEg on T, when the mesh goes to zero.) It turns out that the so- 
defined exploration path gives all the information we need. Indeed, if the exploration 
path closes a clockwise loop around the origin, this means that there is a closed circuit 
around the origin making fr^R equal to zero. On the other hand, if the exploration 
path does not close any clockwise loop until it reaches radius r, it means that fr,R = 1. 
Hence, we run the exploration path until either it closes a clockwise loop or it reaches 
radius r. This is our algorithm. Neglecting boundary issues (points near radius r or R), 
if a; is a point at distance u from 0, with 2r < u < R/2, in order for x to be examined 
by the algorithm, it is needed that there is an open path from 2u to R and the two-arm 
event holds in the ball centered at u with radius u/2. Hence for |x| = u, F[x e J] is at 
most 0{l)a2{u)ai{u, R). Due to the explicit values of the one and two-arm exponents, 
this expression is decreasing in u. Hence, ignoring the boundary, the revealment is at 
most 0{l)a2{r)ai{r, R) . See |SS10bj for more details. □ 

We now assume that the reader is familiar with the notion of Hausdorff dimen- 
sion. We let S C [0, 00] denote the (random) set of these exceptional times at which 
percolation occurs. It is an immediate consequence of Fubini's Theorem that S has 
Lebesgue measure zero and hence we should look at its Hausdorff dimension if we want 
to measure its "size" . The first result is the following. 



Theorem XI. 6 ( |SS10b] ). The Hausdorff dimension of S is an almost sure constant 
m [1/6,31/36]. 
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It was conjectured there that the dimension of the set of exceptional times is a.s. 
31/36. 

Outline of Proof. The fact that the dimension is an almost sure constant follows 
from easy 0-1 Laws. The lower bounds are obtained by placing a random measure on 
S with finite so-called a-energies for any a < 1/6 and using a result called Frostman's 
Theorem. (This is a standard technique once one has good control of the correlation 
structure.) Basically, the 1/6 comes from the fact that for any a < 1/6, one can 
multiply the integrand in J^t~^^^dt by (1/t)" and still be integrable. It is the amount 
of "room to spare" you have. If one could obtain better estimates on the correlations, 
one could thereby improve the lower bounds on the dimension. The upper bound is 
obtained via a first moment argument similar to the proof of Theorem XI. 1 but now 
using (II.l ). □ 

Before moving on to our final method of dealing with the spectrum, let us consider 
what we might have lost in the above argument. Using the above argument, we op- 
timized things by taking r{t) = [l/t)^. However, at time t compared to time 0, we 
have noise which is about t. Since we now know the exact noise sensitivity exponent, 
in order to obtain decorrelation, the noise level should be at least about the negative 
3/4th power of the radius of the region we are looking at. So, events in our annu- 
lus should decorrelate if r(t) >> (l/t)^^^. This suggests there might be potential for 
improvement. Note we used an inner radius which is 6 times larger than potentially 
necessary (8 = 6x4/3). This 6 is the same 6 by which the result in Theorem VIII. 4 
differed by the true exponent (3/4 = 6 x 1/8) and the same 6 explaining the gap in 
Theorem XI. 6| (1 — 1/6) = 6x (1 — 31/36). This last difference is also seen by comparing 
the exponents in (XI. 10) and the last term in (XI. 11) below. 



6 Proof of existence of exceptional times via the 
geometric approach of the spectrum 

Recall that our third approach for proving the noise sensitivity of percolation crossings 
was based on a geometrical analysis of the spectrum, viewing the spectrum as a random 
set. This approach yielded the exact noise sensitivity exponent for percolation crossings 
for the hexagonal lattice. This approach can also be used here as we will now explain. 
Two big advantages of this approach are that it succeeded in proving the existence 
of exceptional times for percolation crossings on Z^, something which |SS10b] was not 
able to do, as well as obtaining the exact Hausdorff dimension for the set of exceptional 
times, namely the upper bound of 31/36 in the previous result. 

Theorem XI. 7 ( |GPS10j ). For the triangular lattice, the Hausdorff' dimension of S is 
almost surely 31/36. 



Proof. As explained in the previous section, it suffices to lower the 5/6 in (XI. 10) to 



5/36. (Note that (XI. 10) was really only obtained for numbers strictly larger than 5/6, 
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with the 0(1) depending on this number; the same will be true for the 5/36.) 

Let s(r) be the inverse of the map r — )■ r^a4(r) ~ r^/^. So more or less, s(r) := r^/^. 
Using Theorem X.18 , we obtain the following: 

E[/«(a;o)//?K)] = ^exp(-t|5|)/^(5)2 

s 

oo 

= E E exp(-t|5|)/«(5)2 

fc=l 5;|5|e[(fc-l)/t,A:/t) 
oo 

< J]exp(-A:)Q[|^/J<A;/t] 



fc=i 

oo 



< 



< 



ai(s(A;A)) 
t 



0(l)^exp(-fc) 



0(l)ai(i?)2 5^exp(-/t)(-^^/3x5/48 



fc=i 

< 0(l)ai(i?)2(^)^/3^ (XI.ll) 

This completes the proof. (Of course, there are o(l) terms in these exponents which we 
are ignoring.) □ 

We have done a lot of the work for proving that there are exceptional times also on 

Theorem XI. 8 ( |GPS10] ). For dynamical percolation on 7? at the critical point pc = 
1/2, there exist almost surely exceptional times t G [0, cxd) such that Ut has an infinite 
cluster. 

Proof. s(r) is defined as it was before but now we cannot say that s(r) is about r^^^. 
However, we can say that for some fixed 5 > 0, we have that for all r, 

s(r) > (XI.12) 

From the previous proof, we still have 

— < O 1 > exp -fc XI.13 

^ ai{s{k t)) 



Exactly as in the proof of Theorem XI.4 we need to show that the right hand side 
is integrable near in order to carry out the second moment argument. 
Quasi-multiplicativity can be used to show that 



a 



i(s(l/t)) < k^^'^ai{s{k/t)). (XI.14) 
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(Note that if things behaved exactly as power laws, this would be clear. 
Therefore the above sum is at most 



fc=i 



aMl/t)) 



<0(1) 



1 



V. Beffara has shown that there exists eo > such that for all r, 



ai(r)a4(r) > r 



eo-2 



(XI.15) 



(XI.16) 



Note that Theorem VI. 4 and (VI. 7) tell us that the left hand side is larger than 
fi(l)r~^. The above tells us that we get an (important) extra power of r in (VI. 7). 
It follows that 



1 



aMl/t)) 



< a4(s(l/t))s(l/t) 



2-eo 



;iA).(i/t)- 



(XI.17) 



(XI. 12) tells us that the last factor is at most f for some r] > and hence the 
relevant integral converges as desired. The rest of the argument is the same. □ 

One can also consider exceptional times for other events, such as for example times 
at which there is an infinite cluster in the upper half-plane or times at which there 
are two infinite clusters in the whole plane, and consider the corresponding Hausdorff 
dimension. A number of results of this type, which are not sharp, are given in |SS10bj 
while various sharp results are given in [GPSlOj . 



Exercise sheet of Chapter XI 



Exercise XI. 1. Prove that on any graph below criticahty, there are no times at which 
there is an infinite cluster while above criticality, there is an infinite cluster at all times. 

Exercise XI. 2. Consider critical dynamical percolation on a general graph satisfying 
0{pc) = 0. Show that a.s. {t : Ut percolates } has Lebesgue measure 0. 

Exercise XI. 3. (Somewhat hard). A spherically symmetric tree is one where all ver- 
tices at a given level have the same number of children, although this number may 
depend on the given level. Let T„ be the number of vertices at the nth level. Show that 
there is percolation at p if 



Hint: Let X„ be the number of vertices in the nth level which are connected to the 
root. Apply the second moment method to the sequence of X„'s. 

The convergence of the sum is also necessary for percolation but this is harder and 
you are not asked to show this. This theorem is due to Russell Lyons. 

Exercise XI. 4. Show that if T„ is n^2"' up to multiplicative constants, then the critical 
value of the graph is 1/2 and we percolate at the critical value. (This yields a graph 
which percolates at the critical value.) 

Exercise XI. 5. (Quite a bit harder). Consider dynamical percolation on a spherically 
symmetric tree. Show that there for the parameter p, there are exceptional times at 
which percolation occurs if 



Hint: Find an appropriate random variable X„ to which the second moment method 
can be applied. 

Exercise XI. 6. Find a spherically symmetric tree which does not percolate at criti- 
cality but for which there are exceptional times at which percolation occurs. 





n 
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